A KAM ALGORITHM FOR THE RESONANT NON LINEAR 
SCHRODINGER EQUATION 
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Abstract. In this note we use the normal forms of the completely resonant non-linear 
Schrodinger equation on a torus (NLS) derived in |15| in order to produce, under a 
KAM algorithm, large families of stable and unstable quasi periodic solutions for the 
NLS in any number of independent frequencies. 
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1. Introduction 

The present paper is devoted to the construction of stable and unstable quasi-periodic 
solutions for the completely resonant cubic NLS equation on a torus T d : 

(1) iu t - Au = n\u\ 2 u + duG{\u\ 2 ), q > 1 £ N. 

Here u :— u(t, ip), ip G T d and G(a) is a real analytic function whose Taylor series starts 
from degree 3. 

Our results are proved by exploiting the Hamiltonian structure of equation and ap- 
plying a KAM algorithm. As is well known such algorithms require strong non-degeneracy 
conditions which are not always valid even for finite dimensional systems and, when valid, 
are generally proved by performing on the Hamiltonian a few steps of Birkhoff normal 
form. This is done in [T5] and |17) in which we exhibit and study the normal forms for 
classes of completely resonant non-linear Schrodinger equations. 

In [15] we study the NLS equation after one step of Birkhoff normal form and give 
"genericity conditions" on the tangential sites in order to make the normal form as simple 
as possible. We recall that the tangential sites are a subset S C 7L d such that the Birkhoff 
normal form Hamiltonian admits quasi-periodic solutions which excite only the modes 

The main results of |15) are formulated in Theorem [I] where we prove that one can 
choose symplectic coordinates in which the normal form is integrable. On the tangen- 
tial variables the normal form is non-degenerate and the motion is quasi-periodic with 
frequency cj = where is a diffeomorphism and £ € K" are free parameters 

modulating the initial data. Moreover, in the "normal variables," the normal form is a 
block diagonal quadratic form, with finitely many blocks of dimension at most 2d and the 
remaining blocks are self-adjoint and of dimension at most d + 1. All blocks have con- 
stant coefficients. These infinitely many blocks are explicitely described by a graph T$ (cf. 



£4.1 1 which contains all the combinatorial difficulties of the structure. This combinatorial 
structure will influence the KAM-algorithm presented in this paper. 

In [17] we address the delicate question of the non- degeneracy of the normal form 
deduced in |15j . we obtain precise results for q = 1. 

In this paper we address the question of constructing quasi-periodic solutions and 
present a general solution. We need to analyze three issues 
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i) The second Melnikov non-degeneracy condition. This we prove by using the 
results of [17] . 

ii) The Toplitz-Lipschitz (cf. 0) or quasi-Toplitz property of the perturbation. 
This is done by generalizing the quasi-Toplitz functions of |22| to this context; 
in particular we need to prove that the changes of variables that we perform to 
integrate the normal form do not destroy the quasi-Toplitz structure. 

iii) The KAM algorithm. This is a variation (with some complications) of a well 
established path; we follow closely the structure of [T] and of [22] . 

In all these steps we need to combine the analysis of [22] with the special structure of 
the graph Tg. This is the source of most of the specific problems for the NLS which make 
this case particularly complex. 

The final result will be the construction for any integer n of families of linearly stable 
(and also elliptic) quasi-periodic solutions for the cubic NLS with n frequencies and 
depending on n parameters varying in a Cantor set of positive measure. The conclusive 
result of this analysis is given in Theorem [7j 

Theorem. Given any set of n generic frequencies S and the corresponding solutions of 
the resonant Hamiltonian depending on n parameters £j we have a Cantor set of positive 
measure in the parameter space for which these solutions persist giving a family of solutions 
of the NLS which are also linearly stable. 

In a non empty open set of this Cantor set the solutions are also elliptic. 

This will be proved by verifying that the NLS Hamiltonian can be brought into a normal 
form which satisfies the properties of an abstract KAM Theorem, Theorem [5] 

Most of the properties necessary for Theorem [5] have been verified for the NLS in 
Theorem 1 of [15], here we have to prove the quasi-Toplitz property of the NLS, cf. { 11 

It is possible to perform a KAM algorithm for any value of q obtaining a weaker result. 
In this case one has property ii) only in block form. 

1.1. Some related literature. The idea of choosing an appropriate set of tangential 
sites S was first used by Bourgain in [5] in a slightly different context. He studied the 
cubic NLS in dimension two and proved the existence of quasi-periodic solutions with 
two frequencies by using a combination of Lyapunov-Schmidt reduction tecniques and a 
Nash-Moser algorithm to solve the small divisor problem (the so-called Craig-Wayne- 
Bourgain approach, see [5] ,[5] and for a recent generalization also [1]). In [5] it is shown 
that, for appropriate choices of the tangential sites, one may find simple solutions for the 
bifurcation equation where only the Fourier indexes of the tangential sites are excited. 
This strategy was generalized by Wang in [T5] to study the NLS on a torus T" and prove 
existence of quasi periodic solutions with n frequencies. A similar idea was exploited in 
[TT] and [TU] to look for "wave packet" periodic solutions (i.e. periodic solutions which at 
leading order excite an arbitrarily large number of "tangential sites" ) of the cubic NLS in 
any dimension both in the case of periodic and Dirichlet boundary conditions. 

In the context of KAM theory and normal form, this idea was used by Ceng in [5] for 
the NLS in dimension one with the nonlinearity |tt| u. 

A similar strategy is used by Geng-You and Xu in [5], to study the cubic NLS in 
dimension two. In that paper the authors show that one may give constraints on the 
tangential sites so that the normal form is non-integrable (i.e. it depends explicitly on 
the angle variables) but block diagonal with blocks of dimension 2. They apply this result 
to perform a KAM algorithm and prove existence (but not stability) of quasi-periodic 
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solutions. We also mention the paper |16j . which studies the non-local NLS and the beam 
equation both for periodic and Dirichlet boundary conditions. 

One of the main points of this paper is to prove the Quasi-Toplitz property for the 
NLS. This idea is introduced by Procesi-Xu [52], as a development of the Toplitz-Lipshitz 
property introduced by Eliasson-Kuksin in their study of the NLS with Fourier multipliers 
[7], which has been the first general existence result. 

Finally we mention the preprints by Wang [19] and [20] . which use the Craig-Wayne- 



Bourgain approach to study quasi-periodic solutions for the NLS (9.2) in any dimension. 



1.1.1. The plan of the paper. The paper is divided into four parts. In the first Part we 
need to recall all the properties of the normal form proved in [T3] and [T7] which will be 
needed. In Part two we start by recalling the formalism of cuts and optimal presentations 
developed in [22 and prove that this formalism is compatible with the structure of the 
graph Tg. Having done this we proceed to define quasi-Toplitz functions in our context 
and prove their basic properties. Parts 3 and 4 are devoted to the KAM algorithm. In 
Part 3 we discuss the general properties of the type of algorithm that we shall apply to 
the NLS while in the final Part 4 we verify that the NLS satisfies all the properties of the 
class of Hamiltonians studied in Part 3. We can finally conclude that the KAM algorithm, 
applied to the Hamiltonian of the NLS starting from the normal form described in Part 
1, leads to a successful construction of a family of quasi-periodic solutions of the NLS 
parametrized by a set of positive measures of the parameters £i, actions of the initial 
excited frequencies. One can discuss also which solutions are stable or unstable. 

Part 1. The normal form 

2. Summary of results from [TS] 

2.0.2. The Hamiltonian. In |15j we have studied the NLS on T d as an infinite dimensional 
Hamiltonian system. After rescaling and passing to Fourier representation^] 

(2) u{t,y) := Mty^ 



the Hamiltonian is (having normalized n): 

(3) H := ^ \k\ 2 u k u k + ^ u kl u k2 Uk 3 u k4 . 

The complex symplectic form is i du k A du k , on the scale of complex Hilbert spaces 

(4) := {u = {u k } k&d | K| 2 + K| 2 e 2a l fe l|fc| 2 " := \\u\\l p < oo}, 

fcez d 
a > 0, p > d/2. 

We systematically apply the fact that we have d+1 conserved quantities: the n-vector 
momentum M and the scalar mass L: 

M:=^% fe | 2 , L:=J]K| 2 , 

k 

with 

(5) {M, Uh] = ihuh, {M, u h } = ~\hu h , {L, u h } = iu h , {1,%} = -% 



4n fact one should work in a slightly more general setting where the torus is the quotient of M. d by 
any lattice A of finite index in X d and write u(t, tp) := X]fcgA* u k {t)e- l ^ k ' v ^ ■ 
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The terms in equation ^ commute with L. The conservation of momentum is expressed 
by the constraints = 0- 

2.0.3. Choice of the tangential sites. If in the Hamiltonian H we remove all quartic terms 
which do not Posisson commute with the quadratic part, we obtain a simplified Hamil- 
tonian denoted H res . This has the property that its Hamiltonian vector field is tangent 
to infinitely many subspaces obtained by setting some of the coordinates equal to (cf. 
|15) . Proposition 1). On infinitely many of them furthermore the restricted system is 
completely integrable, thus the next step consists in choosing such a subset S which, for 
obvious reasons, is called of tangential sites. Without loss of generality one may assume 
that S spans Z d over Z (cf. the footnote). 

With this remark in mind we partition 

(6) Z d = SUS c , S:=(ji,...,j„) 

where the elements of S play the role of tangential sites and of S c the normal sites. We 
divide u <E £ a,p in two components u = (li^tta), where u% has indexes in S and U2 in 
S c . The choice of S is subject to several constraints which make it generic and which 
are fully discussed in [T5] and finally refined in [17]. Here we shall always assume that 
these constraints are valid so we just refer to the results of these two papers in all the 
statements. 

We often use the map n : R™ — > R d , 1 7r(ai, . . . , a n ) := J^i a iji> notice that it maps Z™ 
to Z d , and set 

(7) K:=max|j|. 

If we use on R" the L 1 norm then n is also the norm of the map n. 

We apply a standard semi-normal form change of variables with generating function: 



(8) FBirk — — i ^ 



2\ /2\ u a ut 



c.3e(Z" )H; | Q | = | /3 | = 2,| Q2 | + | / 3 2 |<2 
Efc("fc-/3fc)fc = °- Efc(°fc-/3fc)l'=l 2 ^0 



The change of variables by := e ad ^ FBirk ^ is well defined and analytic: B tQ x B t0 — > 
B2e x B2 eo , for £o small enough, see [15]. By construction brings ^ to the form 
H = H Birk + P 4 + P 6 (u) where P 4 is of degree 4 but at least cubic in u 2 while P 6 (u) is 
analytic of degree at least 6 in u, finally 



(9) H Blrk :=J2\k\ 2 u k u k + 



u a u p . 



fcSZ d Q,^e(Z' i ) N :|Q| = |/3|=2 , |c 2 | + |/3 2 |<2 X 

Efc(« fc - ( 3 fc )fc = 0, E fc (a fc -^)|fc| 3 =0 



The three constraints in the second summand of the previous formula express the 
conservation of L, M and of the quadratic energy 

K := ^ \k\ 2 u k u k . 

k& d 



In order to perform perturbation theory from the system given by the tangential sites 
it is convenient to switch to polar coordinates. We set 

(10) u k := z k for k € S c , u u := = ^(1 + £- + .. -)e ix > for i = 1, . . . n, 

"Si 
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considering the £j > as parameters \yi\ < £j while y,x,w := (z,z) are dynamical 
variables. We denote by £^ a ^ := (Sg' p > the subspace of Z( a > p ) x f( a -P) of the sequences 
iti, Ui with indices in S c and denote the coordinates w = (z, z). For all e > and for all £ 

(11) £e^ e2 :={£ : ie 2 <&<£*}, 



(2! 

Formula (10) is a well known analytic and symplectic change of variables VlQ in the 
domain 

(12) D a . p (s,r) = D(s,r) := {x,y,w : x G T™ , |y| < r 2 , ||«,|| 0iP < r} C T£ x C" x £^ . 

Here e > 0, s > and < r < e/2 arc auxiliary parameters. T™ denotes the open subset 
of the complex torus TJi := C n /27rZ™ where x € C™, |Im(x)| < s. Moreover if 

(13) VMmax(|j l |) p e (5+amax(ljl|)) e < e , 

the change of variables sends D(s, r) — > B eo so we can apply it to our Hamiltonian. 



We thus assume that the parameters s, r, s satisfy (13). Formula (10) puts in action 
angle variables (y,x) — (j/i, . . . , y n ; x\, . . . , x n ) the tangential sites, close to the action 
f = £i, . . . which are parameters for the system. 

The symplectic form is now dy A dx + i J2keS c ^ Zk ^ ^Zk- 

We give degree to the angles x, 2 to y and 1 to w. We use the degree only for handling 
dynamical variables, as follows. We develop in Taylor expansion, in particular since y is 
small with respect to £ we develop + yi = + w: + • • •) as a series in I 1 . 

Definition 2.1. We define the normal form Af which collects all the terms of Hsirk of 
degree < 2 (dropping the constant terms). We then set P = H — Af. 

Notice that the Hamiltonian Hsirk is different from the corresponding one in |15j (in 
that paper we performed a full normal form transformation), however the resulting normal 
form Af is the same since it collects only terms of degree less or equal to two in the variables 
z = u 2 - 

3. FUNCTIONAL SETTING 

Following P3] we study regular functions F : D a ^ p (s,r) x A E 2 — > C, that is whose 
Hamiltonian vector field X F (-;£) is M-analytic from D(s,r) -> C" x C" x tf ■ In the 
variables £ we require Lipschitz regularity. Let us recall the definitions of M-analytic and 
majorant norm (first introduced by Nikolenko, cf. |13j ) and their properties proved in [3]. 

Let us consider the space 
(14) V := C n xC"x tg p 

with (s, r)-weighted norm 

'wli , Ml-* , \\4 



(15) v = (x, y,z,z) £V , \\v\\ v := |M| s , r = |M|y. s ,r = 1 5- H + 

s r- r r 

where < s < 1, < r < e/2 and \x\oo := maxh=i,..., n \x h \, \y\i := J2h=i \Uh\- 

For a vector field, i.e. a map X : D(s, r) — > V, described by the formal Taylor expan- 
sion: 

X = E X ti a ^ iKx) V l ^d^ v = x,y,z,z 
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we define the majorant and its norm: 

MX:= \ X k,L^ slkl fz a ^d Vl v = x,y,z,z 

(16) \\X\\ s<r := sup \\MX\\ V . 

(y,2,z)£-D(s,r) 

The different weights ensure that, if ||Xi?|| SiJ . < \, then F generates a close-to-identity 



symplectic change of variables from D(s/2,r/2) £>(s,r), Proposition 3.3 



Remark 3.1. The notion of M-analytic can be given in general for any map between 
separable Hilbert spaces with prescribed bases. It means that the map given in coordinates 
by the corresponding majorant functions is in fact analytic. It is then easy to see that 
composition of M-analytic maps is M-analytic with the corresponding estimate on norms. 

In our algorithm we deal with functions which depend in a Lipschitz way on some 
parameters £ in a compact set O C A £ 2 . To handle this dependence we introduce weighted 
Lipschitz norms for a map X : D(s,r) xO->y setting: 

\\X II'* sun \\X(r,)-X(Z)\\s,r 

W X f\\s,r,0 — SU P \Z J\ > 

f^JjGO, {x,y,w)£D{s,r) W ~ 5 1 

(17) ||*|| s ,r,e> = ||*IUr := sup \\MX\\ v ,\\X\\ x sr = \\X\\ s ^ + \\\X f \\f r0 

D(s,r)xO 

where A is a parameter proportional to e 2 . Correspondingly for a parameter dependent 
sequence / = {f m {£,)}mei , here / is any index set, we define: 

(18) |/U:=supsup|/ m (OI, \ft p := sup sup l/m f } - fMl , 

Definition 3.2. We define by T-L s , r the space of regular analytic Hamiltonians with the 
norrr0 

(19) \\F\\ 



A ._ II y- iiA 
s,r • II lls.r 



We denote by I = Z" x N" x N z< * x N zd the indexing set of the monomials and for 
all J c I, that is k,i,a,f3 is associated to e l ^ k,x ^y l z a z^ . We define the projection 11/ as 
the linear operator which acts as the identity on the monomials associated to / and zero 
otherwise. In particular we define IIi^x to be the projection relative to the set I of 
k,i,a,f3 with |fc| < K, same for ±l| fe |>/f, similarly we define as the projection on the 
k,i,a,f3 with 2i + \a\ + |/3| = £, same for and U ( ^ e \ 

The main properties of the majorant norm are contained in the following statements, 
proved in j£, Lemma 2.10, 2.15, 2.17. 

Proposition 3.3. Let H,K £ H s , r - Then, for all r/2 < r' <r, s/2 < s 1 < s, A' < X: 

(20) \\Xn\\$,r> <M\X H \\ir, 

(21) \\X {H , K} \\$ ir , = \\[X H ,X K ]\\$ >rl < 2 2 »+ 3 5- 1 ||^||^||^||^ 
where S is defined by: 

(22) 5 :=min(l- -,1- —\. 

I s r 1 

2 in fact Hamiltonians should be considered up to scalar summands and then this is actually a norm 
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Let r/2 < r' < r, s/2 < s' < s, and F g H s , r with 
(23) \\X F \\^ r <5/(2 2n + 6 e) 



with S defined in (22 I. Then the time 1-Hamiltonian flow 

$i : = e adF : £>(s',r') -> D(s,r) 
is well defined, analytic, symplectic, and, \/H G "H Si r, we have Ho <£>^, g T-L s 'y an d 

(24) ||X Ho$ i,|| s , )r , < 2||Xff|| S)r , \\X Ho $i f - X H \\ s , r , < 2||Xir|| sr ||XH || 8)r . 
For all I d I we have 

(25) W^XfW^K \\X f \\* r . 
In particular we have the smoothing estimates: s < s, 

(26) \\n lkl > K X f \\^ r <^ e - K ^\\X f \\i r , 
and the degree estimates 

(27) \\n^x f \\^ r ,<(^f-^x f \\l r , 

Remark 3.4. For a diagonal quadratic Hamiltonian F = ^ m r &m(£,) z mZ m . we have 

oz m m OZ m 

MX F = l^m(OI(«n» o— + ~^), II^fH^ = |l?|oo + A|0|S*. 
OZm OZ m 



Lemma 3.5. For e/2 > r > e 3 , the perturbation P of Definition 2.1 is in and 
satisfies the bounds 

(28) WXpW^KCier + e'r- 1 ), 
where C does not depend on e,r. 

Proof. This is item iv) of Theorem 1 of [TS]. The fact that we are using the majorant 
norm only changes the constant and not the order of magnitude. □ 

4. THE NORMAL FORM 

We will work with many quadratic Hamiltonians in the variables w. We represent a 
quadratic form J 7 by a matrix F as 

(29) F(w) = -(w,wJF t ) = -~wFJw t , 

which expresses the action by Poisson bracket. 

By explicit computation, and under simple genericity conditions, the normal form Af 
is as follows: 

(30) (w(0»»)+ E \k\ 2 \z k \ 2 + Q(t;x,w), u^) = \U\ 2 - 2£i 

here Q(£; x, w) is a quadratic Hamiltonian in the variables w with coefficients trigonomet- 
ric polynomials in x, see Formula (30) of [15 : 

(31) Q(M=A J2 VM^e^ Xi -^z h z k + 
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l<i<j<m. l<i<j<r. 
h,k£S c h,k£S c 



Here ^* denotes that (h,k,Vi,Vj) satisfy: 

{(h, k, v u Vj) \h + Vi = k + Vj, \h\ 2 + \ Vi \ 2 = \k\ 2 + \ Vj \ 2 }. 
and ^2* , that (h,Vi,k,Vj) satisfy: 

{(h,Vi,k,Vj) \h + k = Vi+Vj, \h\ 2 + \k\ 2 = \ Vi \ 2 + \ Vj \ 2 }. 
Notice that in the sums ^** each term appears twice. 

This is a very complicated infinite dimensional quadratic Hamiltonian, one needs to de- 
compose this infinite dimensional system into infinitely many decoupled finite dimensional 
systems. These blocks correspond to the components of the geometric graph which is re- 



called in \ 4.1 One of the main results of [15] is the construction of an explicit symplectic 
change of variables which reduces N to constant coefficients. The final form obtained in 
[15] is thus 

* 

(32) SH=4 £ + 

l<i^j<™ 



+2 V&044 + 2 VWjz' h z, 

l<i<j<m l<i<j<m 
h,k<=S c h,k£S c 



k- 



Since this is needed we recall quickly Theorem [T] of [15] adapted to the case of the 
cubic NLS. 

Theorem 1. For all generic choices S = {ji, • • • i jn} € of the tangential sites, there 
exists a map 

S c 3 k -> L(k) GZ" , \L(k)\<2d 
such that the analytic symplectic change of variables: 

_ p -iL(k).x 7 i ^ 

$ : (y',x) x {z',z') -> (y,x) x (z,z) 

from D(s,r) — ¥ D(s,r/2) has the property that J\f in the new variables has constant 
coefficients, namely: 

(33) Mo $ = (cj(0,y') + J2 ^W k \ 2 + 2K) . 

kes c 



z k = e ^"■'■-z , k , y = y' + ^ L{k)\z' k \ 2 , x = x' . 



where w(£) is defined in (30) and furthermore: 

i) Asymptotic of the normal frequencies: We have &k = \k\ 2 + X)i |ji| 2 -^ (^)- 

ii) Reducibility: The matrix Q(£) which represents the quadratic form Q(£, u/) (see for- 
mula (29)) depends only on the variables £ and all its entries are homogeneous of degree 
one in these variables. It is block-diagonal with blocks of dimension < d + 1 and satisfies 
the following properties: 

All of the blocks except a finite number are self adjoint. 

All the (infinitely many) blocks are chosen from a finite list of matrices 
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Hi) Smallness: If e 3 < r < e/2, the perturbation P :— P o <E> is small, more precisely we 
have the bounds: 

(34) WXpWl^Cier + e'r- 1 ), 

where C is independent ofr,e. 

The smallness condition impies that, if r is of the order of e 2 then ||Xp||g r is of order 
e 3 . As we shall see this is exactly a type of smallness required in order to insure the 
success of the KAM algorithm (cf. Theorem [6]). 

Warning In Z" we always use as norm |Z| the L norm 
hand in Z d , and hence in S c , we use the euclidean L 2 norm. 

4.1. The geometric graph Tg. It is important to recall that the term Q(£,,w') comes 
from the sum of two contributions, the term Q(£, x,w), in the new variables and the 
contribution —2 X)fces c C'^(^)l z fe| 2 (coming from the y variables). In its matrix description 
the two terms will give the off diagonal and the diagonal terms respectively. 

The off diagonal terms are described through a simple geometric construction (which 
gives a complicated combinatorics). Given two distinct elements € S construct 

the sphere Sij having the two vectors as opposite points of a diameter and the two 
Hyperplanes, Hij, Hj^, passing through and jj respectively, and perpendicular to the 
line though the two vectors ji, jj- 

From this configuration of spheres and pairs of parallel hyperplanes we deduce a geo- 
metric colored graph, denoted by Tg, with vertices the points in S c and two types of edges, 
which we call black and red. 

• A black edge connects two points p € Hij, q € Hj^, such that the line p, q is 
orthogonal to the two hyperplanes, or in other words q = p + jj — jj. 

• A red edge connects two points p,q € Sij which are opposite points of a diameter 

(p + q = ji + ij)- 




FIGURE 1. the plane Hij and the sphere Sij. The points 
bi, ij, ji form the vertices of a rectangle. Same for the points 
Q 2, jj, b2, ji 
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The condition for two points p, q to be the vertices of an edge is given by algebraic 
equations. Visibly p £ Hij means that (p — i>i,jj — jj) = 0> the corresponding q = 
p + ]j — jj, while p S Si.j is given by (p — jj,p — jj-) = and the corresponding opposite 
point q is given by p + q = + jj. 

We thus have two types of constraints describing when two points are joined by an edge, 
a linear q — p = jj — j j or p + q = ji + j j and a quadratic constraint (p— ji, ji — jj) = 
or (p — jj,p — jj) = 0. Given a connected component A of the graph we can choose one 
vertex x G S c and use the linear constraints in order to write all the equations which 
define A by linear or quadratic equations on x. We keep track of the linear constraints by 
marking the edges by jj — ji for black edges and jj + j; for red ones. 

Now each connected component A has a combinatorial description which encodes the 
information on the edges which connect the vertices of A. We describe the equations 
defining A by a graph with two types of edges (black, red) marked with pairs i,j € 
[1, . . . ,n]. In [TS] we have seen that such graphs have at most 2d vertices hence we have a 
hnite list of combinatorial graphs. In |17) we have strengthened this estimate, shown that 
for a generic choice the vertices are affinely independent and in particular at most d + 1. 
This stronger estimate is necessary for the proof of the second Melnikov condition. 

We denote by A the combinatorial graph associated to A, note that A encodes the 
information on the equations which the vertices of A must solve; so naturally there may 
be many A associated to the same A. 

Example 4.2. 



X-jl + j3 X - ji + j 2 + ji + js 




the equations that x has to satisfy are: 

(x, 32 - 33) = I J 2 1 2 - (j2, J3) (X, 32 ~ 33) = I J 2 1 2 ~ (j2, js) 

M 2 - (x, ji + 32) = -(ji, j'2) M 2 + (x, ji + j 2 ) = -(ji, j 2 ) 

{x, ji - 33) = \ji\ 2 - (32, 33) (x,ji- 32- 33- 3i) = -Ijil 2 + (ji> J'2) + (ji, js) 

-(32,33) + (iui*) - (h, U) - (33,34,) 

In case the graph has no red edges the equations for the vertex x are all linear. This 
implies that the connected components of Ts which correspond to a given combinatorial 
graph with a chosen vertex are all obtained from a single one by translations by vectors 
which are orthogonal to the edges of the graph. 

More precisely, we have a finite list of non isomorphic combinatorial graphs, Ai, i = 
1, ...,t. To each pair A\,v where v is a vertex of Ai we can associate an affine space 
A4;,-u so that, each point p of A^ which is not contained in a smaller dimensional space 
of type A^. w , belongs to a connected component A with associate combinatorial graph 
Ai and corresponding to the vertex v. 

4.2.1. Roots. By convention we also have chosen a preferred vertex, in each component of 
the graph, called the root. Then for the black components, which come in families under 
translation, the roots of the translates are chosen as the translates of this root. 
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Formalizing, we have a map r : S c — > S c with image a chosen set S c ' r of roots. The 
fibers of this map are the connected components of the graph Tg. There are only finitely 
many elements k with o~(k) = — 1, the finitely many corresponding roots are exactly the 
roots of the components with red edges. 

At this point we can explain how to construct the elements L(k) which tell us how to 
go from the root, of the component to which k belongs, to k. 

When we walk from the root r(fc) to k (inside the corresponding connected component) 
we count the parity ±1 of the number of red edges on the path, this we denote by a(k). 

The equations defining A imply that 

(35) 

k + ]T Li{k)2i = a(k)r(k) , \k\ 2 + £ L t (k)\ j t \ 2 = a(k)\r(k)\ 2 , a{k) = 1 + £ L t (k) 



note that the first of the equations ( |35[ ) defines the L(k), which depend only on the 
combinatorial graph. The fact that this definition is well posed even if A is not a tree is 
a consequence of our generiticity conditions. 

Definition 4.3. We call types the elements j — cr(j)r(j), they lie in a finite list which we 
denote by Z. 

The main fact is that 

Proposition 4.4. The Hamiltonian Q(£,x,w') in the new coordinates z' is the sum 
Y] f Qe(£, w') over all edges I of the geometric graph of the following elements 

• Qe{£, w') '■= ^\Z~£i£j( z 'hz'k + z k^'h) if k are joined by a black edge £ marked i,j 

• 2f(£j w') Al«J {z'^z'f. + z' h z' k ) if h, k are joined by a red edge £ marked i,j. 

From the previous remarks there are only finitely many elements of the second type. 



4.4.1. The matrix blocks of Q and ad(Af). According to Proposition 4.4 the graph has 
been constructed in such a way that we can group Q = J2a Qa where the sum runs over 
all blocks A e Tg and, if E(A) denotes the set of edges in A: 

Q A :=J2-^-HkM\ 2 + E Q ^ w ') 

keA leE(A) 

is a quadratic Hamiltonian in the variables w' A — z' k , z' k with k running over the vertices 
of A. The matrix of Qa has a natural block diagonal structure in two conjugated blocks, 
corresponding to two Lagrangian subspaces in the symplectic space generated by the 
variables w' A := z k ,z' k , k £ A appearing in it. We can thus divide w' A into two conjugate 

components w' A = (u',vf) where u' k = {z'Yj}^ then — \Qa has as matrix Ca ® ~Ca- By 
convention in the first block the root r corresponds to z' T and z' r in the second. 

Given two vertices u' h ,u' k h ^ k G A we have that the matrix element c u / >u i of C 'a is 
non zero if and only if h, k are joined by an edge (marked say (i,j)) and then 

( 36 ) c w h ,u' k =^o-{k)^/S~S~, c u' k ,u' k = >■£(*))• 

By definition L{k) depends only on the combinatorial graph A of which A is a realiza- 
tion, therefore the matrix Ca — Ca depends only on the combinatorial block A. 

Remark 4.5. One may choose the root of each combinatorial graph so that any other 
vertex is connected by a path with at most [(d + l)/2] edges. One deduces the estimates 
Ei \Li(k)\ <d+l, \L{k)\ < (d + l)K for all k. 
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As for —^ad(N"), given a frequency v, we have to understand the action, on the block 
A, with basis the elements e 1 ^ "t x i ' Zm™^ 1 with r(m) = — ^\ i^ji, (on its conjugate A it 
is the minus transpose). The action of Q is only through Qa and gives the matrix Ca, we 
need then to understand the elements (w(£), y')+Y2keS c ^k\z' k \ 2 . The term X]/cgS c ^&l z fcl 2 
contributes on the first block the scalar |r(m)| 2 . As of (w(£),y') it also contributes by a 
scalar, this time J^- J^|ji| 2 — Summarizing 

Proposition 4.6. The matrix of —^ad(Af) on the block A is the sum of the matrix Ca 
plus the scalar matrix [i(|r(m)| 2 + J2i ^lji| 2 ) ~ J2i I A- 

4.6.1. The standard form. By the rules of Poisson bracket we have on the real space 
spanned by z,z that {a,b} = —{a,b} — {b,a} is imaginary so {a, b} — — {a, b} — {a,b} 
and 

Definition 4.7. (a, b) := i{a, 6} is a real symmetric form, called the standard form. 

For the variables we have (z^, z^) = 1, (zh,Zh) = — 1, so the form is positive definite 
on the space spanned by the z, negative on the space spanned by the z and of course 
indefinite if we mix the two types of variables. Thus we may say that an element a in the 
real space spanned by z, z is of type z (resp. z)it (a, a) — 1 resp. (a, a) = — 1. Now choose 
any quadratic real Hamiltonian TL = H. We have {TL, {a, b}} = by the Jacobi identity, 
moreover the map x i— > i{"H, x} preserves the real subspace spanned by z, z hence we have 

(37) 

(a, i{H, b}) = i{a, i{H, b}} = {a, {H, b}} = -{{%, a}, b} = i{i{H, a}, b} = (i{H, a}, b). 



Formula (37) tells us that the operator i{"H, — } is symmetric with respect to this form. 

4.7.1. The case of —^{Q,—}. We apply the previous analysis to i{H,—} = — j{<2, — } 
and its block decomposition. When we have red edges each of the two Lagrangian blocks 
contains both variables z and z and by convention we take as first block the one in which 
the variable corresponding to the root is of type z. The standard form (a, b) := i{a, b} is 
indefinite. 

By assumption the operator — ^{Q, — } can be put in normal form by a change of basis 
preserving the form (a, b). Thus the new basis is formed still by elements which we have 
called of types z and z. 

From all these considerations one has: 

Lemma 4.8. For all combinatorial blocks A which do not contain red edges, the matrix 
Ca is self-adjoint for all £ e A £ 2 . If A contains red edges then each vertex k has a sign 
and corresponds to an element u' k = {z'Y k ^ . 

The diagonal matrix of signs £^ = diag(cr(k)) is the matrix of the standard form in 
the basis u' k = {z')^^ and Ca is self-adjoint with respect to the indefinite form defined 
by £4. 

Lemma 4.9 (conservation laws). In the new variables the conserved quantities are: 

i k i k 

i k 

note that all three quadratic Hamiltonians are represented by a scalar matrix on each 
component of w' A . 
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Proof. We substitute the new variables and use the identities ( 35 ) 



□ 



5. NORMAL FORM REDUCTION 

We now want to simplify Af by applying the standard theory of normal forms for 

quadratic Hamiltonians to each Qa- It is convenient to represent a quadratic Hamiltonian 

as a matrix for its Poisson action on an appropriate function space. For each Qa which 

depends only upon a finite number of variables z k we can describe its action on the 

finite dimensional subspace generated by these variables. Then this symplectic space 

decomposes into the direct sum of two Lagrangian subspaces and — | Qa acts with matrix 

Ca on one block and — Ca on the other. As for —^Af the relevant action is on the space 
p v " • ■ 

(38) 



0,1 of functions with basis e iaa ( k ) v - 



c z k which preserve mass and momentum. [^] 



{L,e < J z k \ = iaa(k)[2_^Vi + 1) e y > z k 



{1 



c 4} 



icrcr(fc)(^^j l 



r(fc))e 



From the previous formulas we have thus that this space decomposes again into blocks 
indexed by the connected components of the graph T$ where mass is —1, each component 
is a symplectic space decomposed into a pair of — stable Lagrangian subspaces, the 



corresponding matrix has been described in the previous section in Proposition 4.6 

The main ingredient we need is : 

Theorem 2. [Theorems 1 and 2 of [IT] / i) For all combinatorial blocks A, Ca has 
distinct eigenvalues, namely it is regular semisimple for values of the parameters outside 
a real hypersurface (the discriminant). 

ii) For any pair of distinct blocks A, B the resultant of the characteristic polynomials 
of the two matrices of the action of — ^ad(Af) is non-zero, hence outside this hypersurface 
the eigenvalues of these two blocks are distinct. 

The algebraic hypersurface union of all discriminant varieties for all the combinatorial 
matrices Ca will be denoted by 21 and called discriminant. 

Since our normal form, thought of as operator has possibly also complex eigenvalues 
let us recall the basic normal form of the simplest Hamiltonians. 

Consider H a := i[a(|zi| 2 — \z 2 \ 2 ) + b(z\z 2 + ziz 2 )], setting a = a + ib. On the space 
with basis Z\ , Zi, z\ , z 2 (symplectic form J) the operator ad(H) has matrix. 



-b 
a 








—a 
-b 



with eigenvalues ±a, ±a, a — a + ib. One easily sees that a 4 x 4 real symplectic matrix 
commuting with this matrix, when the 4 eigenvalues ±a, ±a are distinct, has the same 
block form Mp for some complex number f3 = c + id and so it is represented by the 
Hamiltonian Hp = i[c(|zi| 2 — \z2\ 2 ) + d(z\z 2 + ziz 2 )}. 

Now we need to decompose the various combinatorial blocks that we previously de- 
scribed. We have already defined the discriminant hypersurface 21. For each such matrix 
Ca of some order k we have: 



■^we deviate from the notations of | 15| and in F ' 1 we also impose zero mass 
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Proposition 5.1 (cf. Williamson [2I])- On the region A z i \ 21 the eigenvalues of the Ca 
are analytic functions of £ say $i> . . . , $dim(A) ■ 

For all £ € A £ 2\2t f/iere exists a linear symplectic change of coordinates u' —¥ Ua(£)u' — 
u" such that: 

1. t//i(£) is orthogonal with respect to £a 

2. Ua(Q * s analytic in £. 

3. Ua{£.) conjugates Ca into the following normal form: 

For each real eigenvalue , Ca o-cts as dl on the (one dimensional) eigenspace of •d 
in ua- 

For each pair of conjugate complex eigenvalues -d± — a ± ib, we have a real two dimen- 
sional space such that the two complex eigenvectors lie in its complexification. Then we 
have a basis of this subspace such that Ca restricted to this subspace is a 2 x 2 matrix 




The matrix Tia of the standard form on this basis is diag{l,~ 1) so one of the variable is 
a z and the other a z. 

Since Qa depends on parameters so does U. In order to ensure analyticity we remove 
the algebraic hypersurface 21 so that the multiplicity of each eigenvalue is constant. In 
each connected component of A e 2 \ 21 and for each block containing red edges, we have 
that the number of real and complex eigenvalues is constant. 

Given a geometric block A let A be the corresponding combinatorial block. In each 
connected component, the non-unique choice of the matrix U, putting in canonical form 
the matrix C4 determines a symplectic change of variables for all blocks A with combi- 
natorial block A, we do this for all the finitely many A. We may then index the new 
variables still by S° and decompose S c in two sets: an infinite set Sf, which indexes the 
real eigenvalues, namely is an eigenvector of — of real eigenvalue Then a fi- 

nite set which indexes the complex eigenvalues (note that by the special block structure 
of the Hamiltonian there are no purely imaginary eigenvalues). By the reality condition 
each two by two block corresponding to a pair of conjugate eigenvalues is indexed by 
a pair (h,k) of elements of Sf. We write the conjugate eigenvalues as l dh,k,'&h,k with 
fth,k — a-h.k + Note that for (h, k) € Sf we have a(h) — 1 and tr(fc) = — 1. By abuse 

of notation we still call x, y, Zk, Zk the new variables. We have finally the final diagonal 
form of the Hamiltonian 

Theorem 3. i) For each connected component of A e i \ 21 we have a symplectic change 
of variables ^1 ( depending on £,) which puts the Hamiltonian J\f in the canonical diagonal 
form M = K + 2K 1 where 

n 

(39) K 1 = -Y J ^iVi+ Yl °{k)"dk\z k \ 2 + Yl a hA\ z h\ 2 - \z k \ 2 ) + b h , k (z h z k + z h z k ) 

i=i fees= (/i,/c)eS? 

The elements 'dk,ah,k,bh,k depend on £. 

ii) There exists a connected component (hence an open set) of £ such that in this 
region all the eigenvalues are real i.e. Sf is empty f\17\. proposition 1.13). 

Since Ua mixes only the variables u and since L, M, K are scalar on the components u 
then in the new variables we have still 

(40) L = 5> i + 5>(fc)|z fe | 2 , M = ^j^ + 5>(fc)r(fc)|z fe | 2 , 

i k i k 
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K = £ij*i 2 ^+E ff ( fc )i r ( fc )i 2 i^i 2 - 

i k 

Corollary 5.2. In the elliptic open set of item ii) of the previous Theorem the NLS 
Hamiltonian has the form 

(u,y) + Y / V k \z k \ 2 + P, 

k 

where 

Wi = U*| 2 - 2& , n k = a(k)(\r(k)\ 2 + 2$ k ) 
and P := P o $j satisfies the bound \\Xp\\^ r < er + e 5 r _1 

6. THE KERNEL OF ad(Af) 

6.0.1. Non-degenerate quadratic Hamiltonians. Consider a quadratic Hamiltonian 

(41) Q = (u,y) + ^2 a k\ z k\ 2 + ^2 a h ^ k (\z h \ 2 - \z k \ 2 ) + b htk (z h z k + z h z k ) 

keS? {h,k)es? 

we want to study the kernel of ad(Q) on the space of Hamiltonians of degree < 2. We set 
ah,k := a,h,k + ibh,k so that ±an,k, ±oih,k are the four eigenvalues of ad{a h , k (\z h \ 2 - |z fe | 2 ) + 
bh, k (zhz k + Zh,z k )) acting on the space spanned by Zh, z k , Zh, z k . Next the operator i<2 acts 
on the real space spanned by the elements e lav ' x z^, (we need some convergence conditions 
given by its norm). If as in our case Q commutes with the mass and momentum then it 
acts also on the subspace F ' 1 where 

Definition 6.1. We denote by F 0,1 the space of functions spanned by the elements 
e la "' x zZ with zero mass and momentum. I.e. J^. i>i = — 1, VQ\ + k — 0. 

Definition 6.2. We say that Q is non-degenerat^ if the coordinates u>i are linearly 
independent over Q and its eigenvalues for the action on F 0,1 are all non-zero and distinct. 

It is then not difficult to analyze the kernel of iad(Q), i.e. the elements which Poisson 
commute with iad(Q), on the space of Hamiltonians of degree < 2 commuting with mass 
and momentum, we have: 

Proposition 6.3. If Q is non-degenerate then a Hamiltonian of degree < 2 Poisson 
commutes with Q if and only if it is of the form: 

(42) Q' = (tj',y)+ <4N 2 + J2 a'h, k (W\ 2 ~\z k \ 2 ) + b^ k {z h z k +z h z k ) 

keS£ (/i,fc)es, c 

Proof. Degree zero in w: Monomials of degree < 2 ad of degree in w are of the form 
y l e lv x , I = 0, 1 and for the eigenvalues we have: 

(43) {Qye iv - x } = -i(u,v)y i e iv - x 

Since by hypothesis of linear independence over Q these eigenvalues are if and only if 
v = 0. Hence the Kernel of ad(Af) is x independent and hence of the form c + C ■ y. 

Degree one in w. By definition the eigenvalues of the adjoint action of Q on F ' 1 are 
non-zero. 

Degree two in w: The eigenvectors of degree two in w are obtained by multiply- 
ing eigenvectors of degree one in all possible ways. The eigenvalues by summing two 
eigenvalues. 

□ 

in the usual language we should say regular semisimple. 
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6.4. Eigenvalues and eigenvectors. The fact that the Hamiltonian AT, is non-degenerate 
for generic values of £ is essentially a consequence of Theorem [2] we state it as: 

Proposition 6.5. The normal form J\f is non- degenerate for all £ outside countably many 
braic hyper surf aces. 



The hypersurfaces are the discriminant St, and countably many resultants which we 
wish to describe in terms of the eigenvalues of the matrices Ca- We need to interpret the 
fact that the eigenvalues are distinct for the action on F ' 1 into a statement on commuting 
Hamiltonians. 

We would like to choose coordinates, independent of £, which are eigenvectors for for 
M, L, TV so that 

Lemma 6.6. A regular analytic function F Poisson commutes withAf, M, L for generic 
values of £ if and only if each monomial appearing in F Poisson commutes with M, L, K 
and K 1 . 

Proof. An element commutes with TV if and only if it commutes with both homogeneous 
parts of degree 0,1 that is K and K 1 . □ 

The coordinates we have used up to now are eigenvectors for M, L, K. As for K 1 
we need to take a basis of eigenvectors, this has been done for h € S£. For each pair 
(h, k) £ Sf indexing a pair of complex conjugate eigenvalues we have chosen the symplectic 
coordinates so that the eigenvectors are z£ ±iz^ a . We may take ±izj^ a as coordinates 
replacing z^,z t 7 <T , note however that their are not real symplectic. With this choice the 
previous Lemma holds. 

A monomial m = e^ k ' x ^y l z a zP has momentum i7r r (m) with 
(44) 

7r r (m) := 7r r (fc,a,/3) = 7r(fc)+ ^ j8j->0') r (i) = n(k, a, /?)+ ^ (a,— ^)(a(j)T(j)-j) 

and it satisfies momentum conservation if ir r (k,a, /3) = 0. Note that given functions /, g 
which are eingenvectors of momentum we have 7r r ({/, g}) = n z (f) + n T (g). 

Let us look at the specific cases of degree < 2 divided in three subcases according to 
the degree in w. 



Degree zero in w: In our case oji = |j,| 2 — 2£i, hence the eigenvalues in (43) can be 
zero (as functions of £) only if v = 0. 

Degree one in w: Given k € S£ (corresponding to the eigenvalue dk of CyC) the 
monomial ^"(tyu-x z a j s an eigenvector for all our operators with eigenvalues: 

{L, e iaa{k)v - x z a k } = iaa(k)(J2 Ui + ^ j aaW '"*4, 

i 

{M, e ia<T ^ v ' x z%} = io-cr(k)C}2^j t + r^y^^Zk 

i 

{K, e^'W^zft = iaa(k)(J2 ^ + |r(fc)| 2 ) e^ k ^z^ 

i 

{K\ e w W^4} = w(fc)(- ]T + #k) e iffffW "4, 

i 

Now the last term is an eigenvalue of — £ • vl + C_4 and we claim that the determinant of 
this matrix is non-zero as a function in £. 
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To prove our claim we set the = 1. After this substitution, using Formula (36 1, we 
notice that the entries of C_x are zero modulo two. Using that ^\ = — 1 we have that 
— £ • vl + CjH at £i = 1 reduces modulo two to the identity matrix. 

In the case of (h,k) € 5f (corresponding to the eigenvalues $h,k,$h.k of CU) the 
eigenvectors are e lr7l '' x (z^ ± iz;7 <J ), recall that a{h) = 1. The commutation rules are the 
same as in the real case and one can conclude in the same way as for h G Sy. We have 
proved that for all f outside a countable set of hypersurfaces the kernel of ad(JV s ) is zero 
on functions of degree one commuting with L, M. 

Degree two in w. The eigenvectors of degree two in w are obtained by multiplying 
eigenvectors of degree one in all possible ways. If h, k G S£ we consider the monomial 

(45) ra = e iro W"z k 'zf. 
If aa(h) = —tj'a{k) its eigenvalues are given by: 

{L,m} = iaa{h) u t m, {M,m} = iaa(h)(y] + r(h) - r(fc))m 

i i 

{K,m} = iaa(h)(J2 ^ + |r(M| 2 - |r(fc)| 2 )m, 

i 

{K\m} =iaa(h)(-J2viti + #h-#k)m, 

i 

If aa(h) — a' <j(k) its eigenvalues are given by: 

{L, m} = iaa(h)(^2 Vi + 2) m, {M, m} = icra(h)(^2 "iji + r(/i) + r(fc))m 

i i 

{K,m} = iaa(h)(J2 ^|j*| 2 + |r(/i)| 2 + |r(fc)| 2 )m, 

i 

{K\ m} = iaa(h)(- £ v& +# h + i? fe ) m. 

i 

Similarly if we have (ftx, &i) and (/12, A^) in Sf the corresponding eigenvectors are 

(46) m = e™-* « + itnz^)(< + i<7 2 ^'), 

where <7i , 02 , cr, a' = ± 1 . 

The eigenvalues are given by the Formulas: 

{L, m} = icr(^2 v i + 1 + ^'o") m > { M ; m l = icr (X! ^j' + r ( /l1 ) + o''' 71 '^))™ 

z i 

{K,m} = i<r(^ ^|j,| 2 + |r(M| 2 + ^|r(^ 2 )| 2 ) m, 

i 

{K\m} = ia[- ^ vfa + a hlM +o'aa h2M + i{(Jib hlM + a 2 a' ab h2ik2 )] m, 

i 

finally the statement that on F ' 1 the operator ad(Af) acts with distinct eigenvalues (as 
functions of £) is equivalent to state that the analytic functions that we have exhibited as 
eigenvalues are non-zero except for the terms predicted by Proposition |6.3[ hence being 
analytic they are each non-zero outside a real hypersurface. Since we have a countable 
number of such functions the statement of Proposition |6.5| is proved. 
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Notice that quadratic expressions in w with indexes k 6 and (hi,ki) E Sf do not 
contribute to the kernel of ad(J\f) since the sum or difference of a real and a complex 
eigenvalue cannot be equaled to — £ • v. 

Part 2. Quasi Toplitz functions 

7. Optimal presentations cuts and Good Points 

Let us recall the definition of N -optimal presentations, cuts, good points as given in 
[22] . we omit most proofs. 

7.1. TV-optimal presentations. An affine space A of codimension I in M. d can be defined 
by a list of t equations A :— {x \ (vt,x) — p{\ where the Vi are independent row vectors 
in K d . We will write shortly that A ~ [vi;pi]g. We will be interested in particular in 
the case when Vi,Pi have integer coordinates, i.e. are integer vector^ and the vectors Vi 
lie in a prescribed ball B^ of radius some constant kN . Recall that. Q, we have set 
k := maxi |j^|. We denote by 

{vi)t = Span(ui, . . . , v e ; R) n 7L d , B N := {x £ 1 d \ {0} | |i| < kN}, 

here N is any large number. In particular we implicitly assume that Bjq contains a basis 
of R d . 

For given s G N, in the set of vectors Z s we can define the sign lexicographical order -< 
as in .22] as follows. 

Definition 7.2. Given a — (oi, . . . ,a s ) set (|o|) := (|ai|, • ■ ■ , |ffl a |) then we set a -< & if 
either (|a|) < (|6|) in the lexicographical order (over N) or if (\a\) = (\b\) and a > b in the 
lexicographical order in Z. 

With this definition every non empty set of elements in Z s has a unique minimum. 
Notice that, by convention, among the finite number of vectors with a given prescribed 
value of (\a\) we have chosen as minimum the one with non negative coordinates. 
In particular consider a fixed but large enough N. 

Definition 7.3. We set Hn to be the set of all affine spaces A which can be presented 
as A = [Vi]Pi]i for some < I < d so that that u 4 £ B N , pi e N. 

We denote the subset of Hat formed by the subspaces of codimension I by W e N . 

We display as (pi, . . . ,pf, V\, . . . , v/) a given presentation, so that it is a vector in Z^ d+1 \ 
Then we can say that [vi;pi]t ■< [w^q^t if {pi, . . . ,Pf,Vi, ...,vi)-< {q%, ...,qt,vi,., .,v e ). 

Definition 7.4. The ./V-optimal presentation [U]qi]i of A € H e N is the minimum, in the 
sign lexicographical order, of the presentations of A which satisfy the previous bounds. 

Given an affine subspace A := {x x) = Pi , i = 1, •■■,£} by the notation A^f[vi]pi]i 
we mean that the subspace has codimension i and the given presentation is ./V-optimal. 

Remark 7.5. i) Note that each point m — (mi, . . . , m^) € Z d has a iV-optimal presentation 
(this presentation is usually not the naive one [e^m^ where the form the standard 
basis of Z d ). 

ii) Thus we may use the ordering given by TV-optimal presentations of points in order 
to define a new lexicographic order on % d which we shall denote by a -< n b or a -< b when 
N is understood. 

^such a subspace is usually called rational. 
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Remark 7.6. At this point we extend the definition of -< to the elements of Hat by using 
their A-optimal presentation. 

Lemma 7.7. i) If the presentation A — [Vi;Pi]e is N -optimal, we have 

(47) Q < pi < p 2 < . . . < n- 

ii) For all j < £ and for all v € Bpj \ (vi, . . . , Vj), one has: 

(48) |(«,r)|>p i+1 , We A. 

Hi) Given j < £ set Aj := {x \ (t>i, x) — Pi, i < j}, then the presentation Aj = [vf,pi]j 
is N -optimal. 

iv) Finally —A has a N -optimal presentation —A = [v'^Pi] with the same constants Pi 
and (K|) = 

Remark 7.8. For fixed N, £, p the number of affine spaces in "Hn of codimension £ and 
such that pe < p is bounded by (2kN + l) ed (p + l) e . 

7.9. A decomposition of Z d . From now on we assume that a generic set S of tangential 
sites has been chosen, this choice determines the graph Tg. We also assume to have chosen 
the roots satisfying the conditions of [4.2.1 The case S = is possible and it gives the 
identity map as the root map. 

We shall need several auxiliary parameters in the course of our proof. We start by 
fixing some numbers 

(49) r > max(d 2 + n, 12), n := (4d) d+1 (r + 1) , 

c<i,C>4,A = (d+ l)!K d Cc _1 . 

In what follows N will always denote some large number, in particular N > N a . 

Using the fixed parameters c, C and the notion of optimal presentation, for each N > N 
we want to construct a decomposition 

(50) Z d = Llf =0 Ai{N) 

of 7L d which will be crucial for the estimates of small denominators (cf. \ 10.6.1 ) and given 
by the following 

Definition 7.10. i) A subspace A^[vi]pi]i € H e N with 1 < £ < d is called N-good if 

.ii An 1/ ^ H no flnnA+Arl Kit- 1. 

, + lr\ r\ Ci/~l4-- 



< cN id . The set of A^-good subspaces of codimension £ < d is denoted by Hj 9 
ii) Given A € "H^V 9 the set: 



(51) 

A 9 := {x e AnZ d \ \r(x)\ > CN Tl , |(»,a;)| > Cmax(A 4dT °, c~ id pf ), Vu G B N \ {v t ) t } 
will be called the N—good portion of the subspace A. 

iii) For 1 < t < d — 1 the set Ai(N) is defined by setting: 

(52) A,(A0:= |J A 9 = [j {v z ; Pl ]f. 

AeH , g 

A point m G Z d is called good if it lies in one of the set At(N), 1 < £ < d — 1. 
The remaining points may be distributed in two sets: 

A d = A d (N) C {m e Z d : |r(m)| < CA Tl } 
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and 

(53) A a = A Q (N)c{meZ d :rrA[v l ;p l ] with Pl > CK idT °}. 

Remark 7.11. Notice that every v S Bn \ (vi)e gives a non constant linear function (v, x) 
on A. Thus the good points of A form a non empty open set complement of a finite union 
of strips around subspaces of codimension 1 in A. Note moreover that we are interested 
only in integral points and the integral points in A which are not good form a finite union 
of affine subspaces of codimension one in A. 




FIGURE 2. A drawing of the standard decomposition in 
Aq is Z| minus the dashed lines (each dashed line is described by 
an equation On each dashed line the set is signed 

in solid boldface. Note that [u;p]f is [n;p]i PI minus a finite 
number of subspaces of codimension two, i.e. points. 



The fact that Formula ( 50 1 provides a decomposition of Z d depends upon the following 



Proposition which is a variation of [22] Proposition 1). 

Proposition 7.12. Each point m^[vi,pi\ with |r(m)| > CN Tl and pi < CN Mr ° belongs 
to the set [vi]Pif t for some choice < £ < d. 

Proof. Consider Ai :— [vi;pi], since CN idT ° < cN^ W e see that it is TV-good, so if 
m € A\ we are done, otherwise we have that P2 < C max(iV 4<iro , c~ M pf d ) < ciV^i. 
Consider A 2 :— [vi, V2]P\,p-z\, we see again that it is iV-good, so if m € A 9 2 we are 
done otherwise we have that p% < Cmax(A r4dr °, c~~ 4d p\ d ) < cN 1 ^ we continue in this 
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way and either we show that m £ A\, i < d or we have a sequence of inequalities 
Pj < Cnmx(N 4dT °, c -4d p^l 1 ). Let fc > 1 be the maximum index such p k < CN 4dT ° , if 
k < d we have for all Pj ,j = k + l,...,d that Pj < ( c -4d c y-k N (4d)t- h + 1 T _ We then 
compute the coordinates of m by Cramer's rule and by just estimating the numerator we 
have a sum of d\ terms each of which can be bounded by N d ~ x N^ ^(c^C) 4 - 1 . This 
sum is then bounded by jv5rf(d-i)+(4d) d r 0j it follows that | m | < ^ N 5d(d-i)+(id) d r and 

r(m) < ^/dN 5 ^- 1 ^ 4 ^"^ + dn. Recall r x := (4d) d+1 (r + 1) thus we easily see that 
|r(m)| < cN T1 . □ 

7.13. Cuts. In the previous paragraph a point m which is a good point has an associated 
afhne space A of some codimension £ < d which, as we have seen in the proof of Proposition 



7.12 corresponds to a jump, or as we shall say a cut, in the optimal presentation. In fact 
for technical reasons, having to do with the structure of Poisson bracket, we need to refine 
this notion, introducing auxiliary parameters /i, 9 which give a better control on the cut 
and allow some flexibility in the constructions. This is the topic of this paragraph. 

We assume that N has been fixed. Given a point m we write m— >■[?;.,- for its optimal 
presentation dropping the index £ which for a point equals d, we implicitly also mean that 
m G 7L d . Set by convention p = and Pd+i = oo. 

We then make a definition involving three parameters: 

Definition 7.14. The parameters N,9,/i,t are called allowable if 

tq < t < n/(4d), c<6»,^<C, N > N Q . 

We need to analyze certain cuts, for the values p. L associated to an optimal presentation 
of a point. This will be an index £ where the values of the Pi jump according to the 
following: 

Definition 7.15. The point m^>[vi;pi] has a cut at I G {0, 1, . . . , d} with the parameters 
p = (N, 9, fi, t), if £ is such that p e < fiN T , p £+1 > 9N 4dT . 

The space A := {x \ (vi,x) = Pi, i = 1, ...,£} has [vf,pi]e as optimal presentation and 
it is called the affine space associated to the cut of m. 

Consider a subspace ^4 £ Hjv of codimension £ such that in its optimal presentation 
pe < fiN T . The set of points m £ A which have £ as a cut with the parameters N, 9, fi, r 
have A as associated affine space, if furthermore |r(m)| > 9N Tl we call them (N,9, fi,r)- 
good points of A, we write m € A 9 ^ N s r y 

By definition the other affine spaces have no good points with respect to these param- 
eters. 

Notice that 9N 4dr > fiN T (since M 4 ^ 1 ^ > N^ 4d -^ T< > > N > Cc" 1 > so for 

any given m G S c there is at most one choice of £ such that m has a £ cut with parameters 

9,fi,T. 

Remark 7.16. 1) The purpose of defining a cut £ is to separate the numbers pi into small 
and large. The parameters N, 9, /i, t give a quantitative meaning to this statement. 

2) The set of good points A, N s if non-empty, is the complement in A of a finite 
number of codimension one subspaces. 

3) Given any rational affine subspace A (i.e. defined by equations over Z) there is an 
N so that VJV > JV we have A £ Hn, its optimal presentation is independent of N, the 
set A9 (N,e,ti,r) is non-empty. 
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The connection between the notion of good points A 9 , defined in (51 1, of a given 



subspace A and the notion just introduced is explained by the following Lemma. 

Lemma 7.17. Given p < cN Tl ^ 4d \ set T ( p ) so ^ N r( P ) = max (jV" r ", c _1 p). Then for 
all c < 9,/i < C and for all affine subspaces [vi\pi]g € H^v such that pi = p, we have 
that every point m € [v^pi] 9 , is a (N, 9, /i,r(p)) -good point for [vi\pi\i. I.e. [vi]pi] 9 e C 
(k;Pi]^)(iV,e,M,T(p f )) /° r allc<6,^< C. 

Remark 7.18. 1) If m^-^jpj] has a cut at £ for the parameters 9' , //, r then it has also 
a cut at £ for parameters 0, /i, r with 9 < 9' , fjf < /x provided 0, /i, r are allowable. 

2) If for a given £, r we have pe < c7V r , > CN 4dT , then ^ is a cut with parameters 
9, /i, r for every choice of allowable 0, \x. 

Lemma 7.19. [Neighborhood property] Consider m,r G Z d ura£/i m^>[vi]pi\, r—t[iVi; qi] 
suppose that m has a cut at £ for with the parameters N, 9' , fjf , r, and suppose there exist 
allowable parameters 9 < 9' , fjf < fx: 

(54) |r - m| < kT 1 ^ - fx')^- 1 , k" 1 ^' - ^iV 4 ^ 1 . 



en: 



flj TTie point r has a cut at £ for all allowable parameters 9,/i,t for which (54) holds. 



(2) (w 1 ,...,w e ) = (t>i,..., v t ). 

(3) [wi] qi]e is the N -optimal presentation of [vi;pi]g + r — m. 

Corollary 7.20. Consider m—>[vi;pi], r^[wi;qt] such that 

(55) \r - m\ < fiT 1 ^ 4 "^" 1 - cCT 1 ^- 1 ) 

and both m, r have a cut with parameters p = (N, 9, /i, r) then we can deduce: 

i) the vectors m,r have the cut at the same £; 

ii) the space B associated to the cut of r is the one parallel to A = [vi]pi]f and passing 
through r namely 

(56) B = A + r -m. 

Remark 7.21. With the above lemma we are stating that if m has a £ cut with parameters 
d',fx',T then, for all choices of 9 < 9' , fjf < /i, for which 9,/x are allowable parameters, 
there exists a spherical neighborhood B of m such that all points r £ B have a £ cut with 



parameters 9,/x,t. The radius of B is determined by Formula (54 1. Note moreover that 



if r has a cut at I for some parameters then so has — r and with the same parameters. 



Then lemma 7.19 holds verbatim if in formula (54) we substitute \m — r\ with |m + r\. 

We finally combine [7T71 and [7~20| 

Lemma 7.22. Set r(p) such that N ri - P ' > — max(iV T °, c~ 1 p). For all affine subspaces 
[vi',Pi]e with pi < c7V Tl /( 4d ) the following holds. For all m £ Z d with m £ [v^Pi]^, for all 
r £ % d and for all parameters c < 9, fi < C such that 

(57) \r - m\ < kT^/i - cJJV 70-1 ^ -1 ^ - ^JV 4 * 0-1 , 

r, m /lave i/ie same ciz£ £ iraf/l parameters 9,fX,T(pe) with parallel corresponding affine 
spaces. 

The definitions which we have given are sufficient to define and analyze the quasi- 
Toplitz functions, which are introduced in section [8j 
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7.23. Graphs and cuts. Recall that the choice of the vectors S := determines a 
colored marked graph Tg with vertices in the set S c . 

This graph has finitely many components containing red edgesj^] The remaining set 
will be denoted by S c and it is a union of connected components each combinatorially 
isomorphic to a combinatorial graph out of a finite list Q := {Ti, . . . ,Tn} formed only of 
black edges. It will be enough to concentrate our analysis only on these black graphs. 

Definition 7.24. Given r £ Q we set Ay to be the union of all connected components of 
F 's isomorphic to T. 

The set Q is partially ordered by setting I\ < Tj if r, is isomorphic as marked graph 
to a subgraph of Tj . 

From the theory developed it follows that, if T £ Q has dr + 1 vertices we have that Ar 
is a union of translates of any of its components (of the graph Tg). Moreover ^4r is the 
portion of S° in a union of dr + 1 parallel affine subspaces of codimension dr minus a union 
of finitely many affine subspaces of higher codimension whose points lie in U r >r Ar 4 . 

Let us recall how we arrived at this statement. We choose a root in each T. Using the 
root a geometric realization of T is an isomorphic graph, with vertices in S c , in which 
the image r of the root solves a certain set of dr independent linear equations (Formula 
(61) of [TS]) and the other vertices are determined by the labels on the graph T. The 
components A of Tg which are isomorphic to T are exactly those geometric realizations of 
r which are not properly contained in a larger component. Recall that we have imposed 
generic conditions so that the vertices in a component are affinely independent. Therefore, 
a component A associated to T, spans an affine subspace (A) of dimension exactly dr- All 
other geometric realizations of T are obtained from a given A by translating the graph A 
with the integral vectors orthogonal to (A). This set is thus a union of dr + 1 parallel 
affine subspaces of codimension dr passing each through an element m of A, image of a 
point a £ r. It may well happen that a translate of r may solve also the linear equations 
defining a larger graph, the points in the corresponding component lie thus in a stratum 

Ar^ r 4 >r. 



Example 7.25. 




3.2 



equations (x, ji - j 3 ) = |ji| 2 - (j2, J3) 

0, J2 - js) = I J 2 1 2 - (j2, J3) 



Notice that this is a subgraph of the graph of example 4.2 

If a £ r we let Ar : a be the subset of Ar formed by the corresponding elements. 

The set Ar, a spans an affine space Ar, a of codimension dr and, in fact, is the comple- 
ment in this affine space of the points which belong to graphs which contain strictly F. 
Thus Ar.a is obtained from the affine space Ar. a removing a finite union of proper affine 
subspaces. 

Thus for any T £ Q having chosen a root r we have the stratum Ar r which is the 
complement in an affine space of a finite union of codimension 1 subspaces. Given any 
point m £ S c it lies in a unique stratum Ar. a which is parallel to Ar yI we thus have for 
m a corresponding root r(m) £ Ar, r which is the intersection of Ar^ with the connected 

"A rough estimate of a bound on the norm of these points is (2d + 3)re. 
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component of the graph Ts in which m lies. Thus m — r(m) depends only upon the 
stratum Ar a . 

Definition 7.26. We denote the vector m — r(m) as the type of m. 

Remark 7.27. The possible types run on a finite set Z of vectors each a linear combination 
of the elements ji with coefficients whose absolute values add to at most d, hence each 
u 6 Z has \u\ < dn. 

Remark 7.28. Among the combinatorial graphs we have the graph {0} formed by a single 
vertex. The corresponding open stratum ^4{o} .o obviously spans Z d , it is formed of all the 
points in S c which do not belong to any of the proper strata Ar- 

We have thus finitely many affine subspaces A-p, a associated to the pairs (T,a), these 
subspaces can be presented using the linear equations associated to the geometric realiza- 
tion by formulas (61) of [15] . We have a finite number of possible systems of equations 
with coefficients depending linearly or quadratically from the set S. We verify that all the 
constant coefficients of these equations are < c/Vo (in fact the coefficients can be bound 
by (2dn) 2 ). Thus by the bounds chosen each of these subspace lies in Hn, VA > Ao and 
thus has an A -optimal presentation [wi]qi]d r , furthermore each q^ < cNq. 

Lemma 7.29. Assume that to € Ar. a has a cut at £ with parameters 6, fi, r and associated 
space [vi]Pi]i. Then [vi;pi]i is contained in the affine space Ar.a — qi]d T ■ In particular 
I •(!■. . 

Proof. Since to is in both spaces it is enough to prove that (wi)d r C («*),«. By contradiction 
if some Wj £ (vj)t then we have that \(wj,m)\ > cN 4dT °. This is incompatible with the 
estimates on the q^. □ 

Theorem 4. If m,n have the same cut £ and the same associated affine space [vi;pi]i 
then they belong to the same stratum A^^ a o- n d hence have the same type, i.e. m — r(m) — 
n — r(n). 

Proof. If both to, n form an isolated component then they belong to the open stratum. 
Assume that to € A^, a with dp > 0. Thus to satisfies the equations [wi;qi]d r - By the 
previous Lemma since n £ [vi]Pi]e we know that n e [i*^; <7i]d r ■ This implies n £ Ar> 
where T' contains T so that dr' > dr- 

Exchanging the roles of to, r we see that to G A-p" where T" contains T' . This implies 
that r = r'. Since the equations [wi;qi]d r define the affine space spanned by Ar, a the 
claim follows. □ 



8. Functions 

8.0.1. Toplitz approximation. Given a parameter N > N we will call low momentum 
variables relative to N, denoted by w L , the zj such that |r(j)| < CA^ 3 . Similarly we call 
high momentum variables, denoted by w H , the zj such that |r(j)| > cN Tl . Notice that 
by ( 49 ) the low and high variables are separated. The remaining variables will be denoted 
by w H . Given any set X of conjugate variables by {•, -} x we mean the Poisson bracket 
performed only respect to the variables X (keeping the other variables as parameters). 

Our definitions will depend on several parameters which in turn depend upon the 
particular problem treated. We shall denote them by a compact symbol p. 
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Recall first that a monomial e 1 ^ k ' x ^y % z a z^ has momentum 

n 

i(E fc ^+ E ff C7>(j)(«i - A))- 

i=l iGS- 

Thus we make a: 

Definition 8.1. (Low-momentum) A monomial e ltyk ' x ^y % z a z^ is (iV,/x)-low momentum 
id 

(58) £ |r(j)|(«j +j8j)< A^ 3 , |*|<JV. 
We denote by 

C s ,r(N,n) CM ST 

the subspace of functions 

(59) g = E 9k,i, a , f i^ k ' x) V i * x z f) G H s , r 

whose monomials are (TV, /x)-low momentum. The corresponding projection 

(60) n^:-W s , r ^-£ S!r (iV,M) 

is defined as Ilj^ := 11/ where / is the subset of indexes (k,a,(3) satisfying (58 1 (notice 
that the exponent i of y plays no role). Finally, given h G Z d , we denote by 

£ s , r (N,n,h) c£ 8 , r (N,n) 



the subspace of functions of momentum — ih, i.e. whose monomials satisfy (cf. (44 1): 

(61) n T (k,a,(3) + h = 0. 

By (58 1, any function in C s . r (N, fi), c < /i < C, only depends on x,y,w L and therefore 

(62) g,g' G C s , r (N,fi) ==* {<?, </}^, { 5 , ff '} L do not depend on W H . 
Moreover if 

(63) \h\>fiN 3 + K N C Sir (N,fi,h) = 0. 

Definition 8.2. Given iV and allowable parameters p = (AT, /i, r) we say that a mono- 
mial 

m — mi ; a i — e i ( k > x *>v l z a z l3 z cr z a ' 

is p = (A, /i, r)-bilinear if: 

1) it satisfies momentum conservation, 7r r (m) = 0, ( |44| and: 

(64) \k\<N, |r(n)|,|r(m)| > 0N^ , E l r 0')l(«j + AO < ^ 3 ■ 

3 

2) There is an < £ < d so that both m, n have an £ cut with parameters N, 9, fx, r. 
To the high variables to, n of the monomial m we associate the two affine subspaces 

A, B associated to their £-cut. By reordering the variables if necessary, we may assume 
that A -< B and associate to the monomial, or equivalently to the pair to, n, only A. 

A monomial which is p bilinear with associated affine space A is also called A,p re- 
stricted. 

We shall often write to G p — cut to mean that there is an < £ < d so that to has an 
I cut with parameters p = (N, 9, fi, r). 

For k S Z n , by |fc| we always mean the L 1 norm \ki\. 
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Remark 8.3. Note that by momentum conservation and Remark |7.27| 

0= |7r r (ro)| = \aT(m)+a'T{n)+^2a(j)T(j)(a j -/3 j )+ir(k)\ > \am+a'n\-(2dK,+uN 3 +K,N) 

3 

and we deduce 

(65) \am + a'n\ < uN 3 +3dnN. 

Remark 8.4. Note that under condition 1), in condition 2) it is sufficient to assume that 
m, n have a cut with the same parameters TV, 9, u, r. The fact that the cut is at the same 
£ follows from Corollary 7.20 and (65). 

By Theorem |4j for all A,p restricted monomials m with given am + a'n, a, a' we may 
deduce r(m) — m from A. By Corollary |7. 20 and (65), we deduce 

(66) -aa'A + a'(am + a'n) = B. 

Note that, by hypothesis, m G A^, n G B^. B in turn fixes r(n) — n and hence the type 
of the monomial it(m), defined as 

(67) u(m) := u(A, am + a'n, a, a') = a(r(m) — m) + a'(r(n) — n), 
depends only on the elements A, am + a'n, a, a' . 

Definition 8.5. Setp= (s,r,N,0, fi,r). In % s>r we consider the subspace B p of (TV, 9, fi, r)- 
bilinear functions, that is whose monomials are all (N, 9, n, r)-bilinear. We call 

rip := II(jv,9,/j,t) : Ms.r — > B p 
the projection onto this subspace. A function / G B p is of the form: 

(68) f(x,y,z,z)= f™ a n( x ,y, wL )z™ z n 

0,0'=± |r(m)|,|r(„)|>0iV T l , 

with fZi£(x,y,w L ) G C s , r (N,n,ar(m) + a'r(n)). 

By convention we assume /^'n (a;, y, w L ) — f% £ (x, y, w L ). 

Note that to each element f^i' T n (x,y,w)z^ l z^ is associated an affine subspace A by 
Definition El 



Remark 8.6. Of course, if we take new parameters 9',u' with 9 < 9' , fjf < fi we have 
that the set of (N, 9, fi, r)-bilinear monomials contains the set of (TV, 9', a', r)-bilinear 
monomials. In particular we have, for each s, r: 

H(N,8' ,t)H-(N,8,/j.,t) — H(N,8,iJ,,T)R(N,0',fi',T) = R(N,6' ,fj,' ,r) ■ 

We now fix p, an affine subspace A and two signs <r, a' . 

Definition 8.7. The space Tj[' p of A, p-restricted Toplitz bilinear functions of signature 
a, a' is formed by the functions g G B p where the coefficients of Formula 68 satisfy: 

(69) g = g(<™ + ^)«' - 

m,n 

where the apex with p — (TV, 6*, /i, r), means that the sum is on the A, p-restricted 

monomials (cf. 8.2), with bilinear part z^z" . 
For all h — am + a'n we have: 

(70) g(h) G £ Sjr (/V, u,h + u{A, h, a, a')) 
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where u(A, h, a, a') is the type, see Formula (67). 

Remark 8.8. i) Notice that we have a translation invariance property (which justifies the 
name restricted Toplitz). Indeed given A, a, a',h one can choose arbitrarily an element 
g cr ' cr (A, h) satisfying (70), and use formula (69) to define a function in B p . One easily 
sees that indeed such an expression defines a function in T-L s ,r- 

ii) Note that condition (70) implies that g(sm + a'n) in (69) has momentum i(err(m) + 
a'r(n)) hence g has zero momentum (as required). 

Finally we define 

Definition 8.9. The space T p of piecewise Toplitz bilinear functions 

(71) 9 = £ 9^' (A), g^'(A)eT° : ;'. 

AeU N ,(t,ct'=±1 

By definition T p C B p is a subspace of the (N, 9, jj,, r) bilinear functions. 

Lemma 8.10. Consider f,g G T p and q G C Str (N, fj,i, 0), c < fj,, [ii < C. Given any 
p[ = (s', /, N, 9', /J,', t) with s/2 < s' < s , r/2 < r' < r, 9' > 9, fj/ < \i, one has 

(72) iw^, r {/, q } L , liv,,-,- -{/. q r y G V ■ 

7/ moreover 

(73) K^iV 3 + 3d«JV) < (<?' - 6»)iV T ° , - //) AT"" 1 

(74) n^^^efp-. 



Proof. Write / G 7p C Z? p as in ( 68 1 where 

(75) f^L = r' a '{A,am+o-'n) G £ s , r (JV, p, am+a'n+u) = C s>r (N, fi,ar(m)+cr'r(n)) , 



similarly for g (recall that u — u(A, am + a'n, a, a')), with f%l a n = fnm- 

Proof of ( [72| . Since the variables z^, z° , |r(m)|, |r(n)| > 9N Tl , are high momentum, 

{r °\A,am + a'n)z° m zi , q} L = {f 7 ' a ' (A, am + a'n), q} L 

The function {/ <T ' <T (A, am + a'n) , q} L in in T-L s ' y by (21 1 and does not depend on w H 
by (62). Hence the coefficient of zj,zj in IIjv,e',/j'{/, ?} L is, 

n^,{f' ff ' (A, am + a'n) , q} L G £ s >, r / (iV, //, (rr(m) + a'r(n)) 

since 7r r (g) = 0, 7r r (/ cr '' T (A, am+a'n)) = —ar(m)—a'r(n). The proof that IT N,e',u'{fi q} X:V G 
T s ', r '(N,9' , fi') is analogous. 

Proof of (74). A direct computation gives 

U,g} H = 



(T.<7 '— ± 



|rCm)|,lr(rt)|>fliVf , 
Tn.,nep — cut 



with 
(76) 



m o,a' o" \ ^ ( —<7\,<J l i £<J ,CTl —<7l,<7 



of course Z gives a contribution only if suitably restricted by the bilinearity constraint. 
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By (62) the coefficient p%f n does not depend on wr- Therefore 
(77) n N ,e',„'{f,g} H = V qZfl.zZ-.zZ with 



E 

|r(m)|,|r(ra)|>0'AT T i,cr, ( 



q a >° •= n 



L cr,cr' 
N ,/V r , m,n " 



It results qZf n € £ s >, r '(N, //, crr(ro) + tx'r(n)) by (77), (76), and momentum conservation. 
It remains to prove the (N, 9', //)-T6plitz property: 

(78) qZl a n = Q a,a ' (A + v' n ) for somc Q a ' a ' (A h) e C S:r (N, fl',h + u)), 



where A is the affine space associated to the pair m, n (cf. 8.2 2)). 

Let us consider in (76)-(77) the term with m,n fixed and a = +l,c' = — 1,<ti = +1 
(the other cases are analogous) 

(79) n 



I 



Since by hypothesis /, g G T v we have that the I that give a contribution have a p-cut 
at I and |r(Z)| > 8N Tl . By Remark 8.4 the affine space associated to the cut of I is 
A' = —A + I + m; (while the affine space associated to the cut of n is B = A + n — m). 
We may assume without loss of generality that A -< B. 

We then divide ( 79 ) in 3 parts according to the relative position of A' in the -< order with 
respect to A < B. Note that all these constraints depend only upon A, m — n,j := I + m. 
We treat the case A< A' -< B, the other 2 cases are similar. 

Since f,gEl~ p we have 



(80) 
(81) 



f$$ = f + ' + {A, m + l)e C., r (N, r(m) + r(I)) 
g~<- = g-'-(A', -l-n)e C s , r (N,», -r(l) - r(n)) , 



for all m, n, I which satisfy the bilinearity constraints with parameters p. 

By construction m, n satisfy the bilinearity constraints with the more restrictive pa- 
rameters p' . Set j := m + 1, by formula (65) we have that the elements j, which come 
from the elements I which contribute, have \j\ < /iN 3 + 3dnN. On the other hand by 
condition ( 73 1 we see that these j satisfy ( 55 ) that is: 

(82) \j\ < K -\fx - ^)n t -\ K~ l {e' - e)N UT ~ x . 



thus, by Lemma 7.19 a j satisfying (82) implies that to, n, £ satisfy the bilinearity con- 
straints with parameters p. 
Then 



ll N,ij.' 



E /« 

A-<A'-<B 



J 9u 



loiii 



E 

j'GZ d :| j | <fj.N3+3d.KN 
A^A+j^A+n-m 



f ++ (A,j)g (-A + j,m- 



depends only on A and m — n, i.e. (78 1 . 



□ 



8.11. Quasi Toplitz functions. Given / G H s r and T £ Tp, we define 

(83) / = f{T) := N 4dT (U {NM f-T), 

and for If > N set 



(84) 



Kft,ti 



sup 

N.t:N>K 



ynf (max(||X^| 



,r))] 
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Remark 8.12. If we take new parameters K',8',p' with K < K',0 < 6', fjf < /z we have 
by Remark |8.6| that 

\\x f \C' e '^' < \\x f \W. 

Definition 8.13. We say that / G TL s . r is quasi- Toplitz of parameters (K,9,p) if 
II^O 7 llsV'^ < 00 an( i we ca U this number the quasi-Toplitz norm of /. 

Since in our algorithm we deal with functions which depend in a Lipschitz way on some 
parameters (eO(a compact set) we take finally a norm which includes also the weighted 
Lipschitz norm (cf. (17)) with A a positive number.: 

Definition 8.14. We set p = (s, r, K, 0, p, A, O). We define 

(85) ll*/ll£:= maxdl^ll^MI^Hi,,) 

We denote by C H s . r the set of functions with finite norm 

Remark 8.15. Notice that our definition includes the Toplitz and anti-Toplitz functions, 
setting J- = il(-jv g ^ T )f an d hence / = 0. In the case of Toplitz functions one trivially 
haB\\X f § = \\x' f \\i tr . 

8.15.1. Some basic properties. The following Lemmas are proved in [2]. 

Lemma 8.16. (Projections 1) Set p — (s, r, K, 6, fj,, A, O) and p — (s, r, N, 8, p, t) with 
N > K. Consider a subset of monomials I such that the projection (see (25 )) maps 

(86) Ur.Tp^Tp, VN>K. 
Then H, : -> and 

(87) \\X niF §<\\X F §, VFeQT. 

Moreover, if F £ satisfies 11/ F = F, then, ViV > K, Ve > 0, there exists a decomposi- 
tion HN,0,t2,rF = F + N~ idT F with a Toplitz approximation F 6 T p satisfying HjF = F, 
UjF = F and \\X F \\ S:r , \\X p \\ B , r < \\X F \§ + e. 

Lemma 8.17. (Projections 2) For allleN,KeN,N>K, the projections 

(88) n« II| fc | </f ,ri diag :%^Tp. 

here maps to the space of homogeneous functions of degree I, ILjiag := II^II^o. 
IfFeQ^ then, 

(89) ll-^iico .f ll^ > ll^n| fc | <jr F||^ , ll^n diag F||^ < ||-X>||J , 

(90) \\X F (<2)\\p , \\X F -X (< 2 ) \\p < \\X F \\ff. 

r r \k\<K y y 

Moreover, VO < s' < s, set p' := (s',r,K,0,fx,X,O): 

(91) \\X U{k{iK(F) §, < e - K ^'^\\X F § 

Lemma 8.18. Let Q(z) be a quasi-Toplitz diagonal quadratic function in the variables 
z,z with constant coefficients. There exists a diagonal quadratic function Q(z) G 7~ p , 
Q(z): 

(A,p) 

(92) Q(z) = E E Q( A ) z mZ m , 

|pfl<M« T i 
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so that setting: 

N- Mr Q(z) = ll iNie<fl<T) Q(z) - Q(z) , 
for all m which have a cut at t with parameters (N, 9, fj,, r) associated to A one has 

(93) Q m = Q(A) + N~ idT Q m . 
Moreover one has 

(94) \Qml\Q(A)\,\Q m \<2\X Q $, 

and finally for all m,m' with a cut at I with parameters (N, 6, (J,,t) associated to A we 
have 

(95) \Q m -Q m >\ <N- Mt 2\\X q § 

Proof. Since Q is quasi-Toplitz we may approximate it by a function T € T p ; moreover 
since Q is quadratic and diagonal by the previous discussion we may choose T of the same 
form. 

Hence we can we can find a quadratic and diagonal function Q £ T p so that, with 
Q = N 4dT (Il NAlJ , iT Q - Q), we have ||X Q || r , \\X Q \\ r < 2\\X Q \\J. To conclude, by Formula 



(69 1, we have that a quadratic, diagonal and piecewise Toplitz Q is of the form (92) 



Our last statement is proved by noting that (see ( 16 ) for the norm of a vector field) 

|2 



\X Q f r = 2 sup El^l 2 ^ e 2 ^l|^>|Q. 



2 

J I 



by evaluating at zjf 5 := f^e -0 ^ | j\- p r/V2. The same holds for Q and Q. □ 

9. POISSON BRACKET 

9.1. Poisson bracket estimate. Analytic quasi-Toplitz functions are closed under Pois- 
son bracket and respect the Cauchy estimates. 

More precisely fix allowable p = (s,r,K,8,fi,X,0): 

Proposition 9.2. (i) Given Z' 1 ',/*- 2 ' € Q^, quasi-Toplitz with parameters p we have 
that {/^,/^} € Q^,, for all allowable parameters p' := (s', r', K', 6', //, X, O), with p' 
satisfying : 

r/2 < r' < r, a/2 < s' < s, 2k < (/x - n')K' 2 , kC < (6' - 9)K ,4dT °~\ 

(96) e-b-W K m < 1. 
We have the bound 

(97) < 122^S- 1 \\X fW §\\X f(2) § 
where S := minfl — — , 1 — — ) < 1. 

(it) Given / (1) ,/ (2) as in item (i), assume that 

(98) 3 2 2n+8 e <5- 1 ||X /(1) ||T<l/2, 

then the function f^ o^!>* (1) := e tad ^ > - ) (/^ 2 - ) ), /or |i| < 1, is quasi-Toplitz for the param- 
eters p' . More precisely f^ o £ Q^, for all parameters (K' , 0', //) for which 
(99) 

e -( s - s ')(^W^ < i ; 2k < (/i-^)^' 2 ln(^') _2 , ^C< {6' -e)K' MTa ^\n{K')-\ 
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Finally we have 

(100) ||X /(2)o0 ,J|J, <2||A>>|| 



(101) Il^/( 2 )o0* (1) — XfwWf, < 2\\Xf W \\p\\Xf( 2 ) || J 

Proof. In order to prove this proposition we need some preliminaries. □ 



9.3. A technical Lemma. Take allowable parameters p := (N, 9, /i, r) and p' := (N, 8' , fjf, r) 
such that 

(102) nC < (9' - 6)N 4dT - 4 , 2k < (/x - M ')A 2 . 



Let us set up some notation. In Definition |8.1| we have introduced the notion of a 
monomial m = e l ^ k,x ^y l z a z^ of low momentum with respect to the parameters p and 
denoted by Ilj^ the projection on this subspace. Recall that one of the conditions is also 
that \k\ < N, that is it has also a low frequency. 

We shall say instead that m is of N-high frequency if \k\ > N and denote 11^ the 
projection on this subspace. 

We denote its degree in the high variables to be dfr(m). We further set 

m(M) := + ft"), m L (M) := £ |r(j)|(a, + ft)- 

j low 

The projection symbol Hn,0,/j.,t is given in definition |8.2[ 

We use a mixed decomposition / = IIjv^^.t/ + 1^,3"'/ + Hyv/ + Hi?./ where 11^/ is 
by definition the projection on those monomials which are neither (N, 9, fx, r) bilinear nor 
of (JV, 3/i')-low momentum nor of TV-high frequency. 

We hence divide the Poisson bracket in four terms: {•, •} = {•, -} v ' x + {•, -} L + {•, -} H + 
{■, - } R where the apices identify the variables in which we are performing the derivatives. 

We need the following technical lemmas and definitions. 

Lemma 9.4. Consider a monomial M = e^'^Az^ = e^ fe,x ' 'y 3 ' z a z& ' z° n which, with re- 
spect to some parameters p, has \k\ < N, m(A) < fiN 3 + kN. Assume that z m is a high 
variable i.e. |r(m)| > cN Tl . Then M cannot satisfy conservation of momentum. 

Proof. If we have conservation of momentum n(k) + Y) j cr(j)r(j)(aj — (3j) + ar(m) = 0. 
By the triangle inequality we have 

(103) cN Tl < |r(m)| < m(A) + kN < fiN 3 + 2kN < (C + n)N 3 



a contradiction to Formula ( 49 1 . □ 



In order to simplify the notations let us set IT := ,e' ,t , n := 11^,0. p. T - Assume 
/ j f^ are two functions satisfying conservation of momentum. 

Lemma 9.5. The following splitting formula holds: 

(104) n'{/w,/( 2 )} = n'({n/w,n/( 2 )}^ 

+{ii/M , nfc i3M , + {n/w , n L N ^, f^} L 

/( 2 )} + {/«n£/( 2 >} - {n£/« n£/( 2 )}) 
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Proof. We perform a case analysis: we replace each /W with a single monomial to show 
which terms may contribute non trivially to the projection H'{f^\ f^}. 
Consider the expression 

(105) n'{e* (1, ' I )/ 1, / (1) ^ ( %* ,2) ^/ !, z« <2) ^ ,2) }. 

If one or both of the \k^\ > N then one or both monomials are of high frequency and we 



obtain a term in the last line of (104 1. 



Suppose now that l/c^l, \k^\ < N we wish to understand under which conditions on 
the aW,/?W the expression (1051 is not zero. 



For a monomial M := e { ^y a z°'z 13 if U'(M) / Owe must have d H (M) = 2 (plus 
further conditions). For two monomials Mi, Ma we see that each term of {Mi,M2} has 
as degree da ({Mi, M 2 }) equal to: 

i) dff{Mi) + (iff (M2) — 2 if we have contracted conjugate high variables zj, zj° . 

ii) dff(Mi) + dn(M 2 ) otherwise. 

In case i) in order to have LI'{Mi,M2} / we must have dff(Mi) + dn(M 2 ) = 4, 
this happens either if a) d#(Aii) = d H (M 2 ) = 2 or b) d H (M 1 ) = l,d H (M 2 ) = 3 (resp. 
dn(Mi) = 3,dH(M 2 ) = 1). Let us show that, by conservation of momentum, case b) 
is not possible. Let us denote by A^i = 1,2 the part of the monomials Mi in the 
variables z h which are not high i.e. M 1 = e l{ - kl ^y ai A x zf , M 2 = e ii - k ' 2 ' x) y a ' 2 A 2 zJ z^z^ . 
In {Mi, M 2 } the part of the monomial in the variables Zh which are not high is A\A 2 so, 
if n'{Mi, M 2 } / Owe must have m(AiA 2 ) = m(Ai) + m(A 2 ) < fx'N 3 , we are thus in the 
hypotheses of Lemma [974] for Mi a contradiction. 

In case a) we claim that both monomials M lt M 2 are N,/j,,9,t bilinear, so that this 



contribution comes from the first line of (104). For this we only need to verify that, if 



zj is the variable we contract, then j has a cut at I for the parameters N,6,h,t. Write 
Mi = e i(ki,x) y a lAlZ ± z ^ M2 _ e i(k 2 ,x) y a2 A2Z j z cr\ Since n'{Mi,M 2 } / Owe have 

m(A\) + m(A 2 ) < /z'TV 3 . By conservation of momentum |r(j)±r(m)| < fi'N 3 + nN hence 



I j ±m\ < n'N z + kN + Mk, using ( 102 ) we have 

I j ± m\ < fi'N 3 + nN + 4dn < fiN 3 . 



Since m has a £ cut for the parameters /i',9',t and the hypotheses in Formula (54) of 



Lemma |7.19| are satisfied, hence j has the cut and we obtain the first term in formula 



(104) 



In case ii) we can have dn(Mi) + dn(M 2 ) — 2 either if a) dy(Mi) = 2,dn(M 2 ) = 
(resp. riff (Ml) - 0,d H (M 2 ) = 2) or b) d H {M{) = d H {M 2 ) = 1. 

We claim that in case a) we obtain the contributions of lines 2,3 of Formula ( 104[ ). 



In fact say that djy(Mi) = 2,d H (M 2 ) = and Ml = e^ kl ^y a A lZ ^z^ , M 2 = 
e l ( k i - x ) y b A 2 where Ai do not contain high variables. We have that \ki\ < N by hypothesis, 
the high variables of Mi have a N, \j! , 9' , r cut also by hypothesis and so also a N, /1, 9, r 
cut, finally if we contract variables x,y we have m(Ai), m(A 2 ) < mi{Mi,M 2 } < fi'N 3 . 
Assume we contract conjugate variables Zh which are not high, let Ai — BiZj^ so that B\B 2 
is the part of {Mi, M 2 } in the low variables and m(Bi) + m(B 2 ) < /i'N 3 . By conservation 
of momentum for M 2 we have |r(/i)| < m(B 2 ) + kN, hence m(A 2 ) < 2m(B 2 ) + kN < 
2/jl'N 3 + kN. In both cases we deduce that we we obtain the contributions of lines 2,3 of 



Formula ( 104 1 



Let us show finally that, by conservation of momentum case b) is not possible. We 
are now assuming that for instance Mi = e l ^ kl,x - ] y ai Bxz h zf n with z rn high while Zh not 
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high, hence \r(h)\ < cN Tl and m{B 1 ) < fi'N 3 . We have m{B 1 z h ) < fi'N 3 + cN Tl . So by 
conservation of momentum we have. 

6'N T1 < |r(m)| < m(Bt) + |r(/i)| + kN < fi'N 3 + 9N Tl + kN 

which implies (9' - 9)N Tl < f/N 3 + kN contradicting ( |102[ ). □ 

9.6. The proof of Proposition [972], We use all the notations and hypotheses of |9.2| 



We can first use the standard estimates (21 ) and obtain 

(106) \\X {fWtfW} \\* y < 2 2 «+ 3 5- 1 ||X /(1) ||^||X /(2) ||^ < 2 2 "+ 3 «5- 1 ||X /(1) ||I||X /(2) ||T 
here 5 is defined in (22 1. 

Since the allowable parameters K',9',ii' satisfy (|96| we have that ./V, #',//, r satisfy 



fll02"] ) for all N > K',t > r . In order to show that {/ (1) ,/ (2) } is quasi-Toplitz (with 
respect to the chosen parameters), it is enough to provide, for all N > K' and the allowable 
parameters p' := (N, & , //, t) and for e > a decomposition 

so that J 7 ^ 1,2 ' £ T p i and also 

(107) ||^ (1 , 2) |k r ,,||X /(1 , 2) || sV , < 122 2 "+ 3 5- 1 ||X /(1) ||j; r |!X /(1) ||j; r + 0( e ). 

For e > take 6 7^, p = (s, r, iV, 9, /i, r) so that setting 



f (l) ^^(n^/W-^)), 

nua(||X J , w ||., f . ) ||X /w || >iP )) < ||X /W ||£ + . 



(108) 
we have 
(109) 

We thus define the function 

^ = n W)T ({^w, j-< 2 >}" + {^^n^/W}^^ + {n^/ (1) ,-F (2) } (l/ ' x)+i ) 

where we have denoted {•, • }(J'> :r )+ L = {■, .y(V' x ) + {•, -} L . We need to show that, for e 
sufficiently small it satisfies the required conditions. 

Lemma 9.7. (i) One has J^ 1 ' 2 ) e 7^'. 

fuj Setting f (1 ' 2) = N MT (Tl'{fW~fW} - J"^- 2 )) one /ias </ia< i/ie fcouratfe 1107] ZioW. 



Proof, (i) The constraints (I72J) are satisfied and we just apply Lemma 8.10 



(ii) The estimate (107) for J-^ 1 ' 2 -* follows by Cauchy estimates since 

||Jfj7(i,2)|| a ' )r < < ||X{j r (i)^( 2 )}|| s '^' + ||-^{.F(l),/(i>)}||s',r' + \\X{F(2)J-(i)}\\s' : r' 
< 2 2n+3 6 1 [||JCj7(i) || ria ||Xjc(2) \\ r .s + \\Xjr(i) \\ rtS \\X f(2) \\ r ,s + II r,s \\Xjr(2) \\ r>8 \ 



<32 2n+3 5^\\X f(1) \\i\\X f(21 \\i+0(e). 



We now estimate ||Xj(i, 2 ) \\ s ', r '- We have 



/i^) = ^n'({/W ) /( 2 )}4j(V (2) } ff -{J (1) ) 4, 3 ^«}^H^^n^ ¥ /(V (2) } (9 

We substitute in formula ( fl04| IIp/W = J"W + N- 4dT f (l K Thus /t 1 ' 2 ) = II' (S) with 
(110) 5 = [{^W+Ar-^W/^ + i^ 

+{/ (1) ,n^, / ( 2 )}^ + {n^/( 1 ),/( 2 ))}^ 
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[{Ti u N f^j^} + {/« n£/< 2 )} - n^/( 2 )}] . 

In order to estimate the norm ||Xj(i,2) \\ s ',r' we estimate the norm ||XH|| s / >r /. 
The first two lines of 5 are estimated by 9 2 2n+3 5~ 1 \\X fm ||J \\X f & ||J + 0(e) and the 
last by the smoothing estimates (26), ||^n^/lls'.r' < 2e~ N ( s ~ s >\\Xf\\ Str , gives 



ll x {n^/ci),/C2 )}+{/ (i) ) ng/(2)}_ { n^/ti),n^/W}IU' 1 r' 
< 82 2 »+3 e -^-«') ( 5-i||x /C1) || Sir ||X / ( 2) || S!r . 
Since by ((96} N^e-m*-*') < 1, the estimate (fl07|) follows. 



□ 



Proof. Conclusion of the proof of (proposition 9.2) Proposition |9.2[i) follows from the 



previous Lemma. The proof of (ii) is identical to that of Proposition 5 (ii) of [52] . □ 

Part 3. The KAM algorithm 

10. An abstract KAM theorem 

The starting point for our KAM Theorem is a class of Hamiltonians H, variation of 
the Hamiltonians considered in [22] : 

(111) H:=N + P, P = P(x,y,z,z,0- 

Af := (u}(£),y) + ^ ^k(0\ z k\ 2 + X] a h,k(\zh\ 2 ~\zk\ 2 ) + b h ,k(zhZk+ZhZk), 
fees,-; (h,k)esf 

defined in D(s,r) x O, where O C A e 2 is a compact set of diameter of order e 2 . The 
functions O n (£), a^fc, bh } k ar e well defined for £ e O. In our examples the set S% of 
complex eigenvalues or hyperbolic terms is finite. 

It is well known that, for each £ € 0, the Hamiltonian equations of motion for the 
unperturbed TV admit the special solutions (x, 0, 0, 0) — > (x+uj(£)t, 0, 0, 0) that correspond 
to invariant tori in the phase space. 

Our aim is to prove that, under suitable hypotheses, there is a set Ooo C O of positive 
Lebesgue measure, so that, for all £ 6 the Hamiltonians if still admit invariant tori 
(close to the ones of the unperturbed system). 

Given a value £ of the parameters we have the torus given by the equations y = z = 0. 
If the Hamiltonian vector field Xh of a Hamiltonian H is tangent to this torus, and if 

d 

on this torus it coincides with X^:=i Ct(£) o — then the Hamiltonian evolution is quasi- 

periodic on this torus. In term of H this amounts to say that, for this value £ we have 
that the term is constant (and we usually drop it), the term H^- ' 1 ) = and finally 

jj(i,o) _ In this case we shall say that £ defines a KAM torus for i?. 

Therefore our goal is to find a change of variables (possibly in a smaller domain) so that 
we have a large set Coo of parameters defining KAM tori for H . The precise statement is 
contained in Theorems [5] and [6j 

We start by describing the class of Hamiltonians to which the method applies. 

It will be useful to be more explicit in the representation of the low order terms: 

(112) = f°(x) + f l (x; w) + f(x; y, w), 
the apex represents the order. 



3(5 



M. PROCESI*, AND C. PROCESI**. 



10.0.1. Compatible Hamiltonians. We consider a class of Hamiltonians stable under the 
KAM algorithm. 

In the construction there will appear parameters 

p = {r,s,K,6,fi,O,a,S ,M,L), (e, 7 ,6) 

playing different roles, where O C A e 2 has positive measure (of order e 2n ) while all the 
others are positive numbers such that K > Nq,"/ < 1 and: 

- c)K T °, (C - 6)K Mto > kK 4 , 

(113) 7 < 2e 2 M < 1 , (8Me 2 ) -1 > S„ > Ay/nML , a<M, LM < 4 
recall that k = maxdjil) and see ( [49] ) for the definition of N a , c,C,t . 

Definition 10.1. We consider Hamiltonians, defined in D(s,r) x O, of the form: 

(114) H :=Af + P,Af:=(Lu(0,y)+ £ fik(0N 2 +C P = P(x,y, z,z,£), 

n 

C= ^ a h,k(\ z h\ 2 - \ z k\ 2 ) + b h: k(z h z k + z h z k ), (a>(Q) y) = y]wj(£)yj. 

(h,k)eS? i=l 
We also may use for the imaginary eigenvalues the complex coordinates, in that case 
unless there is a risk of confusion we may write the Hamiltonian in full diagonal form 
Af := y) + YJfceS c ^fc(£)l z fc| 2 with the understanding that finitely many z k are 

complex coordinates which come in pairs and that the corresponding Ofc(£) are complex 
and in conjugate pairs. 



We say that a Hamiltonian (114) is compatible with the parameters p if the following 
conditions (Al)-(/14) are satisfied: 

(Al) Non-degeneracy: The map £ — > w(£) is a lipeomorphisnj^jfrom O to its image with 
< L. Setting Vj := |j;| 2 , for i = 1, . . . , n we have |w(£) — vj^ < Me 



2 



(A2) Asymptotics of normal frequency: For all n € S c we have a decomposition: 

(115) n n (0 = o-(n)(\r(n)\ 2 + 20 B (O) + ^(0- 

We assume that the are chosen in a finite list of analytic functions which are 

homogeneous of degree one in £, moreover the f2 := {f2 n }„ g 5c are Lipschitz functions 
from O -> witlff 

(116) M'* + |n|S*<M and 2|0|£<M, 2|i?U < Me 2 . 

(A3) Regularity and Quasi-Tdplitz property: the functions P, (t?z, z) := X^j^jNil 2 an d 
(fiz, := ■ rijjzjl 2 are M-regular, preserve momentum as in (44|, are Lipschitz in the 



parameters £ and quasi-Toplitz with parameters (K,9^fx) (cf. Definition 8.13). Moreover 
for all N > K, r < r < n/4d we have II(iv,( AT ) ^ &j\Zj\ 2 € T(n,0,ii,t)- 

We need to control the norms of the above functions, we use the free parameter 7, 
whose purpose is to estimate the measure of the various Cantor sets which will appear, 
and set p= (s,r, K,0, fi, A = 7M~\(9). 

^in our applications all maps will actually be analytic 
^recall that on R n we use the l°° norm. 
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We define: 

(117) 7 - 1 ||X p(l) ||J:= £ W, i = 0,l,2, e=(e(°),eW,e( 2 )), y'^Xp^ := Q 
We require that 

(A3*) Smallness conditional 

(118) 0<1, J-'WX^W^ <1, ne\e]K^<l. 

We note that from condition (A3) and by Remarks 8.15|3.4| we have that 



\x { ^ z) § = \\x^ ZiZ) \\i r = |tfu+ m i g = i^+jm- 1 ]^. 



Then ( |113[ ) and ( |116| imply \\X {i} ^ z) \\l < 2Me 2 



(AA) N on- degeneracy (Melnikov conditions): We havep*] 

(119) inf \A i Mw,k)+Q.-l)\ >a, Vfc e Z", / e Z s " , \l\ < 2 , \k\ < S , (k,l)^0, 

for all (k, I) compatible with momentum conservation and such that, setting V„ := o~(n)\r(n)\ 2 , 
one has: 

(120) (v,fc)+V-Z = 0. 
Observe that (v, k) + V • I e Z. 



Remark 10.2. If H is compatible with the parameters p it is also compatible with all 
choices of parameters p' where 

s' <s, r' <r, K' >K, O' C O , 



provided that (118) still holds 



In working with the cubic NLS we will also have (at the first step of the algorithm): 

(A2*) Homogeneity: The functions — v, fl m — <j(m)|r(m)| 2 are analytic and ho- 
mogeneous of degree 1. 

Remark 10.3. By the homogeneity of w(£) and # we may easily see that M,L,a can be 
taken e independent. If the condition (A2*) were that the functions are homogeneous of 
degree q > 1 then we would only have ML, Ma -1 as e independent constants. 



10 by |e| := |e (0) | + + |e (2) | we mean its L 1 norm note that |e| < 30. 
11 we denote by A (tV f = tMhlM 
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10.3.1. Infinite dimensional KAM theorem. Now we state our infinite dimensional KAM 
theorem. 



Theorem 5. Assume that a Hamiltonian Ao + Pq in (114) * s compatible with the param 



eters po — (ro, sq, K,9q, fiQ,Oo, ao, So, Mq, Lq), i.e. satisfies (Al — AA), (^42*), that: 

(121) C -Y' c > 29 o, \O \=As 2n , M L = 2, S = 16Vn, 

and that K is sufficiently large (depending only on the remaining parameters po). 

There exists < 1 and a positive constant B such that if furthermore Oo < Q and for 
all 7 such that Bj < As 2 , we may construct: 

• (Frequencies) Lipschitz functions u^, : Oo — > K™ , f^oo : Oo — > ioo, satisfying 

(122) Iu^-ujoW l^-fi^l^ < 7 |eo|6- 1 



where eo is defined by Formula (118) and lui^ 119 , jfiool^P < 2Mq 



(Cantor set) A Cantor set 



o ' 

fte smallness condition on 7 ensures that Ooo has positive Lebesgue measure. 



(123) |Oo\Ooo|<Se 

a 



• (KAM normal form) A Lipschitz family of analytic symplectic maps 

(124) $ : D(s/A, r/A) xO M 4 D(s, r) 

of the form $ = I + \& im£/i ||$|| r /4,«/4 < |eb|; where Ooo is defined in the previous item, 
such that, 

(125) fl oo (-;0:=-ff o *(-;0=w oo (0l/oo+n oo (02oo2oo+i ,o ° k»a P<2 = 0- 



Formula (125) tells us that the final Hamiltonian has invariant tori parametrized by 



Remark 10.4. We will use the freedom given by Remark 10.2 to choose K large enough 
so that some necessary bounds, which appear in the proof, hold. 

It is important to notice that all the conditions which we shall impose on K are inde- 
pendent of r and O - 

Theorem[5]is proved by an iterative procedure. We produce a sequence of Hamiltonians 



H u = N v + P v , each of these Hamiltonians will satisfy the properties (A*) of Section 10.0.1 
for suitable parameters p v . In particular for the compact sets O v we have the telescoping 
O v C O v -\. We set O^ — r\„O u and the construction will be such that O^ has positive 
measure. 

We have a sequence of symplectic transformations $„ : H v -\ := H v , where is the 
value at 1 of the flow generated by the Hamiltonian vector field Xp u _ x associated to the 
generating function F v ~\ determined as the unique solution of the Homological equation, 
and depending on P~\. The transformation is well defined on the domain D(s v , r v ) x O v , 
with Z?(s„,r„) C D(s u -i, r„_i). At each step, the perturbation remains bounded while 
the part P^ 2 becomes smaller. We also denote D{s v ,r v ) = D u . The Topliz norm of a 
function G defined on D v x O v and relative to the parameters extracted from the p v will 
be denoted by \\G\\ V . 
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The goal is to pass to a limit Hamiltonian Hoo = Moo + -Poo with the property that 
P^ 2 — so that the Hamiltonian vector field on the family of tori parametrized by the 
parameters £ € Ooo, where the normal coordinates are 0, coincides with the Hamiltonian 
vector field of Moo • 

The relevant estimates to be performed are the following. 

• We have to estimate the norm of each F„_i so to make sure that the value 

at 1 of the flow generated by the Hamiltonian vector field Xp u _ 1 is well defined 
on D y x O v . 

Here the problem is that of small denominators since we have to divide by 
eigenvalues. Here the quasi-Topliz properties play a major role. 
The key is Proposition 1 1 . 1 5} 

• While we establish the previous item we have to estimate the measure of the set 
O v , Lemma [TOTOl 

• We have to perform all the estimates on the new parameters p v , this is done in 
gOT7} 

• We have to estimate the norm of the part Pjf 2 , for this we have to control simul- 
taneously the three parts in which it naturally decomposes. 



We need to prove that the set Ooo has positive measure, Corollary 10.21 



We need to prove that on the set Ooo we have a limit change of coordinates 
giving rise to the limit Hamiltonian |10.21| 



10.5. KAM step. 

10.5.1. Formal KAM step. The input of a KAM step is a Hamiltonian of the previous 
form and a frequency cut K. The output is a subset + C O of the parameters £, two 
new values for the radii of the domain s + < s, r + < r and a symplectic transformation ^ : 
D(s + ,r + ) xO + -> D(s,r) of type * = e ad( - F \ so that finally we have a new Hamiltonian 
H + = H o >]/ which we expect to be a simplified version of H so that, after iterating 
infinitely many times the KAM step, we hope to arrive at the desired final Hamiltonian 
which shows the existence of quasi-periodic orbits as in Theorem [5] 

The function F is obtained by solving the homological equations. In order to explain 
this it will be convenient to write explicitly the terms of P^ (x,y,w): 



pM( X;W )= e p i:L^ i(k ' x) ^ 

mGS c , (7— ±, k 
k k 

Only those terms which satisfy conservation of mass and momentum may appear. We set 

(126) [p^ 2 ] = (pr\y) + E p &,m,-\^\ 2 +v. 



Where V is diagonal only in the complex notation and arises from the term C of the 
normal form Af (see ( 114[ )). 



On the space of quadratic Hamiltonians ad(J\f) has a basis of eigenvectors described 



in ^6.4 On the space relative to the non-zero eigenvalues ad(Af) is formally invertible, 
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hence for those £ for which the Melnikov resonances do not occur, [P- 2 ] is the projection 
of P<k on the kernel of ad(N). We define 

(127) F :^ad(N)- 1 {Pf K -[P^ 2 ]) => ad{F)N = [P^ 2 ] - Pf K 

and since ad(A/") _1 is diagonal (at least in complex coordinates) this definition can be 
given degree by degree, thus defining F°, . Notice that even if we use complex 

coordinates F is always real. 



10.5.2. Estimates. Formula (1271 defines F as a formal expression. We now impose a 



lower bound on the eigenvalues of ad(Af) on the space of functions of degree < 2 which 
implies that F is analytic. Let us restrict our attention, for instance, to the set O of 
(eO such that: for all /c € Z™, |fc| < K and I € Z s , |Z| < 2 which satisfy momentum 
conservation, we have 



(128) 

Lemma 10.6. 

(129) 



Kw,fc) + (j,n)|> 7 jr 



-2dn 



We have: 



s,r.O 



< K 2dT ^- l \\X! 



A 

\s.r,0 > 



0,1,2. 



Proof. We first notice that ( 128 ) implies that P< K — [P- 2 ] is a sum of eigenvectors of 
ad(N) with eigenvalues bounded from below (in absolute value) by jK~ 2dTl , therefore 
since we are using the Majorant norm we have 



|A 



< K^j^WX 



^p(i)-[P(i)]\\s,r 



7 



I A" 



pw l 



i = 0,1,2. 



□ 



Then by Proposition 3.3 F defines a symplectic transformation e^ ad ^ F ^ provided that 
2 2rl+3 ei5 _1 ||Xi?|| s . r < 1 holds for a suitable choice of s',r' and possibly restricting to 
a subset of the parameters. More precisely in the next paragraph we will define a set 



C + C O (see Definition 10.8) and show in Lemma 10.10 that, provided 7 is sufficiently 



small, this set has positive measure. On this set we shall prove in Proposition |10. 11) that 



the inequalities (1281 hold. In order to iterate this procedure we need to be sure that F 
is quasi- Toplitz and estimate its norm, this will be proved in Proposition 10.15 So for 
the procedure to succeed we need that the perturbation P^ l \ i < 2 be rather small. A 
sufficient condition of smallness, verified by the NLS, is discussed in Theorem [6| 

10.6.1. KAM step. For simplicity, below we always use the same symbol cost to denote 
constants independent on the iteration. 

We now start from a Hamiltonian in the class of Dcfinition |10.1| and describe a procedure 
which produces a change of variables under which the Hamiltonian is still in the same 
class with new parameters which we estimate explicitly. 



KAM Step 1. (1) We Define in \ 10. 8[ a compact set + C O such that 
(130) 



\0\0+\< Tl £ 2{n - 1) K- T0+n+d / 2 , 



where T is a constant depending only on n,d,K. 



(2) We construct, by Formula (127), the function F. We prove in Proposition 10.15 



that F is quasi-Tdplitz with parameters K, 9, /i for all £ €E 
For all positive numbers r + < r and s + < s for which: 



(131) 



3 2^ +B K(min(l - 



l-'-±))-i e \e]K- < , 
s 2 
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we show that F generates a 1 -parameter group of analytic symplectic transformations 
: D(s + ,r + ) —> D(s, r), well defined for all t, \t\ < 1 and for all £ <= + . 



(3) Applying Proposition 9.2 ii) with p' — (N+, 6 + , s + , r + ) we show that QpH := 



H+ = A/+ + P+ is quasi- Toplitz for all choices of parameters K + , 6 + , fi + satisfying ( 99 1 



The core of the construction is to compute parameters M+, L+, a + , see (155), and 



0+, e + , see ( 160 ) relative to the new Hamiltonian. The iterative KAM algorithm is based 
on the fact that if O in Theorem [5] is small enough then H + is compatible with the 
parameters p + and respects the smallness condition (A3*), so one may iterate the step. 

10.7. The set 0+. 

Definition 10.8. C + = C+ l7 is defined to be the subset of £ £ such that: 

i) For all k G Z™, |fc| < K and h £ Z, so that (h, k) ^ (0, 0). 

(132) \(uj(0,k)+h\>2jK- To . 

ii) For all k £ Z n , \k\ <K,m£ S c , with ir(k) ± r(m) = 

(133) \{u{£,),k)±n m \>2 1 K- Ta . 

hi) For all |fc| < K,m,n £ S c min(|r(m)|, |r(n)|) < CK Tl and 7r(fc)±(r(m)+err(n)) = 
with cr = ±1: 

(134) |<u;(0,fc)±(n m + c7Q n )| > 2 1 K- 2dT K 

For all |fc| < K,m,n £ S c |r(m)|, |r(n)| > CfT Tl and 7r(fc) ± (r(m) + r(n)) = 

(135) |(w(0, *) ± («m + On) I > 2 7 ^- 2rfri 

iv) For all natural TV with K < N < (cC" 1 ) 1 / T °if ri / T °, for all affine spaces [uijp^ in 
"Hat (1 < ^ < ei) with < cN Tl / 4d we choose a point e A := [^;p»]f with 
|r(m,4)| > CN Tl . For all such and for all k such that |fe| < K, we require: 

(136) |<w(0» *> + *W - a* I > 2jmm(N- 2dT °,c 2 % 2d ), 
for all n such that r(n) = r(m>i) + 7r(/s). 

In order to analyze C+ we need a first Lemma for the measure estimates. We define 

n,i ■■= {e g 01 Kw, fc) + (/, o)i < 1 k- t ) , iez sc . 

Lemma 10.9. For all (k,l) 7^ (0,0) |fc| < K and \l\ < 2, which satisfy momentum 
conservation, one has 

(137) \ni,\<^e^ n -^K- T . 

a 

Proof. Let us first state a general fact. Let / be a Lipschitz function on the domain 
O C A £ 2 such that 

■ ■ .,£n) - ■ • -(fn)l > - y| 

for all x ^ y such that (x, £2, • • • , 60, £2, • • • , fn) € C 

We consider the map F : (£) 1— > (/, £ 2 , ■ • ■ which maps bijectively to some set 
B. F is a lipeomorphism and its inverse has Lipschitz constant < a -1 . In B, f is a 
coordinate and the level surfaces of / are contained in a hypercube of volume e 2 '™ -1 ) 
therefore the volume of the set where |/| < c can be estimated by 2£ 2 ^ n ~ 1 ^c hence on O 
it can be estimated by 2a~ 1 e 2 ( n-1 )c. 



42 



M. PROCESI*, AND C. PROCESI**. 



If |fc| < So, we have assumed that, for all functions fk,l(Q = (&,k) + (Z,f2(£)) which 
satisfy (1201 we have infj^go \A^ iV fk t i\ > a. So we may apply the previous argument. 
If on the other hand fk,i does not satisfy (1201 (but |fc| < So) then we may write 



/*,i(0 = n + *M(0> 



where n is the non-zero integer computed in ( 120 1 and 



l-Ffc.il < Kw(0-v,fc)|+4sup|i? n (0|+2|n| 

n 

by hypothesis |fc| < So (in particular So > 1) while 2|#| 0O , 
hypothesis (^43*) we have 

| J < 7 < 2Me 2 

(So 



v|oo < Me 2 . Finally by 



101 



- \\ x {az, z ) 



So, by (113), we have > |n| 



4) Me 2 > i and we may deduce that RJ, l is 
empty. Finally if |fc| > So then we change the variables form £ to oj and study G^^iw) := 
(ui,k) + (Cl(£(u))),l). Let e k be the versor of k, we may perform an orthogonal change 
of variables in u so that e k is the first vector in the standard basis. Then we repeat our 
argument with respect to u, indeed 

\(m")),i)\ l £<\m\ l £\"- l \ l £<ML 

so that 

t G k .i(x,uj2, ...,ui n )- G k j(y,u> 2 , . .. ,ui n ) , ^ |fc|_ 

/n 



> 



2ML > 2ML . 



x-y 



in lu(0) (recall that |fc| > S > A^/nML 



for all vectors (x,u>2, ■ 
by hypothesis). 

Therefore the volume of the set where IG^^o;)! < c can be estimated by (MIJ'^Me 2 )'™ -1 ^. 
The corresponding volume in the space of the parameters £ is therefore estimated by 
i n (Af J L)- 1 (Me 2 )("- 1 )c = (LAf) Tl - 1 Af- 1 e 2 ("- 1 )c. Since ML < 4 the estimate follows. □ 

Lemma 10.10. The set C + is compact and one has 



(138) 



\o\o, 



< COSt -£ 

a 



Proof. Since k runs on a finite set the functions |(w(£),£;)| are bounded on O, hence the 
first formula (132) is satisfied for \h\ large, for instance \h\ > 2\w\ 00 K. So (132) is actually 



implied by a finite number of inequalities. 



Formulas ( 133 1 and ( 134 ) are a finite number of inequalities by definition. 



Formula 



( 135 ) is a priori an infinite list of inequalities, we note however that | (w, k) ± (Cl 
\u\ooK is large when |f2 OT + f2„| > 2|w|ooiir. 

Next J7 m has an integral part cr(m)|r(m)| 2 where cr(fc) = 1 since |r(m)| > CK Tl . This 
implies that Q m + Q„ is large, and hence no condition is imposed, except possibly for 
finitely many values of m, n. Finally ( |136[ ) is given only for finitely many elements; notice 
that in iv) for each [vi;pi] 9 e and k we impose only a fixed finite number of conditions by 
choosing a point tua and a type u — r(n) — h € Z. Finally by Remark 7.8 there are a 
finite number of [vi]pif t . Thus 0+ is compact. 

Let us prove the measure estimates; we use the symbol < to mean that we have < cost 
where cost depends only on n, d, k. By definition G + is obtained from O by removing a 
finite list of strip s lZ T n a where a runs in a suitable set of pairs k, I. Denote by W A := U a eAT^a 



and, by Lemma 10.9 we estimate \TZ A \ < |^4|7a 1 e 2 (" T . The Lemma will thus 



follow from an estimate on the cardinality of A for the various cases considered. 
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Recall that the elements m G Z d with <r(m) = — 1 are a finite set of some cardinality 
depending only on k, n similarly their norm can be bounded by some R of the order of 
k 2 . By Hypothesis (Al), \oj\oo < k 2 + 1, and we may assume that K is large so that 
R < y / 2(n 2 + 1)K. 

i) As previously remarked we have to impose (132 1 with \h\ < \oj\oqK < (k 2 + 1)K we 
have: 

A Q := {(k, h) |, \k\<K, \h\ < (k 2 + 1)K], \A \ < ( K 2 + l))(2K) n+1 
\K T Ao \< e 2( "- 1) 7a- 1 -ftT- T0+n+1 . 



ii) In (133), by momentum conservation I = ±e m implies that ±r(m) = — 7r(fc). Hence 
to impose (133) we have to remove the list indexed by A±: 

A 1 := {(k,l) \,\k\ <K,l = ±u m , 3uE Z :m + u = T^(k)} , \A X \ < (2K) n d. 

\TZ T Al \< s 2 ^ n - 1 ha- 1 K- T0+n+1 . 

iii) If I = ±(e m + e n ) and a(m) = cr(n) = 1, the index (k,l) can contribute only if we 
have the condition 

\±(w,k) + \r(m)\ 2 + \r{n)\ 2 +$ m + i3 n + n m + n n \ < -. 

This condition implies | ± (u, k) + |r(m)| 2 + |r(n)| 2 | < 1 hence |r(ro)| 2 + |r(n)| 2 <2\u\ 00 K. 
Setting 

A 2 := {(k,l) \,\k\ <K,l = ±(e m +e„) , |r(m)| < ^2(k 2 + l)K , 3u, v e Z : m+n+v+u = 
\A 2 \ < {2^2(k 2 + 1)K + l) d d 2 , \TZ T M | < £ 2(n-l) 7a -l K -T +n+d/2_ 

iv) Setting 

A 3 := {(k,l) |, \k\ < K , I — e m - e„ , |r(m)| < CK Tl , 3u, v e Z : m-n + u-v = T^(k)} 
One has 

\A 3 \<K dTl+n \K 2 A d p\ < 1 a- 1 e 2{n - 1) K- dTl+n . 

v) To deal with the last case, for all natural N such that K < N < (Cr 1 c)lT ri /" ro , for 
all affine subspaces with pi < CN^s , for all \k\ < K and for all types u E Z we set 
n = r(m) + u + 7r(/c) and define 



(139) KAvrMuu : = I K w > fc > + ^ - n fl | < 2 7 min(7V 



Following Lemma 



10.9 



l^fr i I < 7a _1 £ 



^ 2dT \c 2d p- l 2d )} 
1 - Jl " 1 )min(7V- 2 ^,c 2ti p7 2d )- 



We distinguish the two cases. First when min(iV 2rfr °, c 2d p^ 2d ) = N 2dT ° we are in 
Af := {(k, [v 4 ;pj<,u)||k| < if Juijpi^ EUn- Pl< cN T ° , u E Z} 
when min(iV" 2dTo , c 2d pj 2d ) = c 2d p~ 2d we are in 

:= {(fc, ||fc| < JT, \vi\pi\t € Hjv : c7V r ° < Pe < CN™ , u e Z} 

By Remark pfTs] we have |Af | < A^+roHd-iH™ hence 

|7^« | < la ^ £ 2(n~l) N d(d-l)+T (-d-l)+n < Ta -l £ 2(„-l)^-rfro 

as for we use again Remark 7.8 to bound with {2nN) d (2p) d ~ 1 the number of subspaces 
with given p = pe for all I. 

-2d-l+djyd 2 j^n ^ ^ a - 1 £ 2 (™- 1 ) jy—dr +d 2 j^n 



\K A »\< ia -^-^ Y, E : 

AT>K p e >cN T o 



N>K 
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< 1 a~ 1 £ 2(n ~ 1) K~ dTo+d2+n+1 

(from (49 1). Summing all these contributions, since we can bound all the factors by 
e 2 (n-i) -l^-m+n+d+i Qur L emma Jg p r0 ved. □ 



We arrive now to the key estimate which handles small denominators. 

Proposition 10.11. For all £ G +! for all k G Z n , \k\ < K and I G Z 5 " , \l\ < 2 which 
satisfy momentum conservation, we have 

(140) \{u,k) + (/, 0)| > 1 K- 2dT K 



Proof. By Definition 10.8 the cases i), ii), iii) follow trivially since 2dri is large with 
respect to tq. 

We are left with the case £ — e m — e n with |r(m)|, |r(n)| > CK T1 . To start we have 
\(u,k) + Q m - n B | > \(u,k) + |r(m)| 2 - |r(n)| 2 | - 4|0| c 



2101 



r(m) the type of m, we 



We bound the two terms 4|tf|oo + 2|fi|oo| < Me 2 + AMe 2 by We need to estimate 

\(ui,k) + |r(ra)| 2 — |r(n)| 2 | and first note that, setting v 
have: 

(141) 



(w, k) + |r(m)| 2 - |r(n)| 2 = (u, k) - |7r(fc)| 2 - 2<7r(fc), r(m)) = 

(w, fc) - K(fc)| 2 + 2{ir(k),v) - 2(7r(fc), m). 

Note that 7r(fe) G Bjf U {0}. Let m^fu^pj], we distinguish two cases: p\ > CK 4dT ° or 
pi < CJf 4dT0 . 

Case 1, pi > Cif 4dro . 



If 7r(fc) ^ and pi > Cif 4dr ° then by definition | (n(k), r(m)) | >pi- 

Kw, A) + n ro - fi n | > Kw.fc) + |r(m)| 2 - |r(n)| 2 | - 4|^U - 2|fi|, 
CK 4dT " - \uj\K - k 2 K 2 - 2e 2 M - AMs 2 > 1. 



> 



If pi > CK 4dT ° then m has a cut at £ = 0. By Lemma 7.29 we have that m is on the open 
stratum and r(m) = m. So if 7r(fc) = we have m = n and the denominator is covered 
by the bound ( |132[ ) with h = 0. 

Case 2, pi < Cif 4 ^ . 

By hypothesis the point m has |r(m)| > Ci4T Tl and pi < CK 4dT °, thus by Proposition 
m belongs to some A 9 where A^[vi;pi]i, 1 < £ < d. Note that n = r(m) + 7r(fc) + u 



7.12 



r(myi) + v and consider k,A,n 



and m = r(m) + v. 

In A we have chosen a point mj, write tua 
t(i71a) + 7r(fc) + u for the two types u,v G 2. We will show that the inequality (136) 
implies that ( 128 ) holds in our case. We prove this fact by using the hypothesis that 
(A3) - (A3*) hold for O. 

By definition, for all c < 9, [i < C, all m G A 9 have a cut with parameters (K, 9, /i, r(pi)) 
where K T ^ Pt ^ — max(c _1 p£, K T °). Let us first notice that ( 128 1 with I — e m ~ e„ is surely 
satisfied if | (vr(fc) , r(m))| > K 3 because in that case ( |141 ) is greater than 2K 3 — n 2 K 2 — 
\oj\K- 8dK > K 3 by assumption ( flT3j >. 

If on the other hand \(n(k), r(m))| < K 3 , then 7r(fc) G is in (Uj)^. In fact otherwise 
we would have |(7r(fc),m)| > cK 4dT ° by definition of A 9 , hence |(7r(fc), r(ra))| > cK 4dT ° - 
2dn 2 K > K 3 by Formula (11131), this is a contradiction. 
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Note that for all m £ A 9 the type v = m — r(m) is fixed by Theorem [4] so by the fact 
that ir(k) £ follows that (7r(fe), r(m)) = (7r(fe), r(m^)). Recall that is one fixed 
point in A 9 on which we have imposed the non-resonance conditions ( |136[ ). Thus for all 
m £ A 9 either ( 128[ ) is trivially satisfied or 

|r(m)| 2 - |r(n)| 2 = -\n(k)\ 2 - 2(n(k),r(m)) = -\*{k)\ 2 - 2(n(k),r(m A )). 



If A^>[vi] Pi]i by Lemma 

eter r(p e ). By (A3) we know that Unm,^,t X)„ "&n\z n 
hence for all m £ A 9 , d m — d mA . We deduce that if n(k) £ (vi)t 



7.17 



all in £ A 9 are also in A 9 K g , . for some suitable param- 
2 is Tdplitz, for all allowable 9,p,,r, 



n«l. 



By (A3) we also know that Sl(z) := ^2 m f2 m |; 
which satisfy (1131 hence, we may apply Lemma 8.18 with Q(z) = Cl(z). Lemma 



is quasi- Toplitz with parameters K, 9, p, 



7.17 



insures that all m £ A 9 satisfy the conditions of this Lemma with N 
needed in order to have the estimate (95) with r = r(pe): 



K, which are 



„, li <2\\n\$K-<* r M. 



We proceed in the same way for n = m + ir(k) — v + u. Let n—> [wt ; qi\ , we notic e that n £ 
Bx'j (m) and by hypothesis ([i—c)K T , (C—9)K 4dT > kK 4 hence, by Lemma 7.22 n has an I 
cut [wf, Qi]i with parameters N, 9, /i, r(jpi). Now, by Lemma 7.19 ii), [wf, q^\i is completely 
fixed by A and 7r(fc) — v + u. This shows that setting h = rriA + 7r(fc) — v + u — m,A + n — m, 
both n and n have a cut with the same parameters and the same associated subspace 
[wi\qi\£. Since Hn,6,h,t 5Z« ^nl z n| 2 IS Toplitz, this means that d n — Similarly we 
have 

\n n -n n \<2\\n§K- 4d ^\ 

In conclusion when Tr(k) £ (vi)i we have 

\n m - q„ - n mA + ry < 4\\n§K- UrM , 

where by definition K T ^ — max(AT T ° , c~ 1 p). We now apply the constraint (1361 and 
hence: 



(142) 



\{<j,k) + n m - n„| > \(cj, k) + fi mA 



Hal - 4||0||^- 4dT ^ > 



2 7 min(^- 2 ^, c 2d p- 2d ) - 4||Q||J min(A- 4dr °, c 4d p~ 4d ) 



now, by (1181, < 7 and we get 

\{u,k) 



fin I > 7min(^- 2dro , c 2d p7 2d ) > 7 iT 



2dri 



since p e < CK T ^ 4d =*> c 2d pj 2d > (c(T 1 ) M - f£: ~ Tl/2 and (cf. (49)) X > A > Cc" 1 
implies (cCr 1 ) 2 ^^ 1 / 2 > Jf-n/2-2d > ^-2dn _ 

□ 



Remark 10.12. This Proposition essentially says that, by imposing only one non resonant 
condition ( 136 ), we impose all the conditions ( 128 ) with I = e m — e n such that m £ [vi\p^ 9 
and n = m 



■7r(fc). 



Remark 10.13. Notice that up to now we only use (1361 with N = K. Indeed the 



other non-resonance conditions are only required in order to show that the quasi-Toplitz 
property is preserved in solving the homological equation. 
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10.14. The Toplitz property for the generating function F. The function F has 



been obtained by solving the homological equation and given in Formula (127). We now 
prove: 

Proposition 10.15. For £ 6 + the solution of the homological equation F is quasi- 



Toplitz for parameters (K,8,/i), moreover one has the bound (cf. (117) and recall p - 
(s,r,K,e,n,\ = 1 - 1 M,0)): 

(143) \\XfM§ < J-^WXp^ = Kc«, K = K 2 ^/ T ". 

Proof. We have already given in Formula (129) a better bound on the norm \\X F <r 11 1 



hence to prove our statement we only need to consider the quasi- Toplitz norm Wi^'^ 
The quasi-Toplitz property is a condition for N > K, on the (N, 0, fj,, r)-bilinear part 
of F&. Hence if i = 0, 1 the Toplitz norm coincides with the usual majorant norm and 



( 143 ) follows from the bounds ( 129 ) 



We are reduced to proving our statement on the quadratic terms: 

Tim a >F (2) - Fi p^*'*)* z +Bu P ^ k ' x ) z z +Ci p^-^z z 



\k\<N , 

»(|r(70|,|r(m)|)>fliV- 



with 
(144) 

i-i _ Pk,0,e m ,e n „ _ Pk,0,e m +e n ,0 r< _ Pk,0,0,e 



(k, co) + Q m — f2„ ' : ' (uj, k) + £l m + fl n ' ' : (w, k) — fl m — f2 r , 
By hypothesis min(|r(m)|, |r(n)|) > 9N Tl so in the case of Bk, m ,n one has 



|-Pfe,0,e m +e„,o| 



< C- 1 |Pfc,0,e m+e „, |^ ri , 



|(fc,w) + |r(m)| 2 + |r(n)| 2 + 2An + 2*„ + r! m + a-, 
since 

\(k,Li) + \T(m)\ 2 + \r{n)\ 2 + 2$ m + 2$ n + n m + n n \ > 2c/V Tl - \u\N-4\0\n +2|OU > c/V ri . 

Since N 4dT ~ Tl < 1 this means that ^ fc m n B^m^e'^^^mZn is quasi-Toplitz, and we may 
take the "Toplitz approximation" equal to zero. The final estimate follows by formula 



fl!29D 

II ]T B k , m ^ k ' x) z m z n \\l?^ < ma X {K M ^'y- 1 1 c- 1 )\\X pm \\i tr = K^^WXp^. 

k,m,n 

Same argument for J2k mn ^ k 
We thus have to study J2km.n F k 

Take N > K, denote by p := iV, 6*, /x, r, we wish to decompose 

(145) F fc , m , n = J" fe (m-n,[ Ui; ^] £ ) + iV- 4dr F feim! „, 

so that J-'fc is the A: Fourier coefficient of a Toplitz approximation T € 7^. 

By momentum conservation we have 7r(fc) + r(m) — r(n) = hence |r(m)| 2 — |r(n)| 2 = 
-|7r(fc)| 2 -2(7r(A:),r(m)). 

By assumption 

|r(m)|, |r(n)| > 8N Tl , m-t[vi;pi] , n^>[wi;qi\, 

(146) q e , Pe < ixN T , qt +1 ,pt+i > 9N Mt , A := -< B := [w^q^ , 
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namely m,n have a cut at I with parameters (N,9, h,t) and hence, by Corollary 7.20 
(vi, . . . ,vg) — (wi, . . . ,w e ) . We distinguish two cases: 

Case 1, ir(k) ^ (vi)i. If 7r(fc) ^ (wi)^ then by the definitions of cut 7.15 of optima 
presentation 

|(7r(fc),r(m))| > \(w(k),m)\ - 2dnN > 9N 4dT - 2dnK > cN 4dT /2 
we have 2|(7r(fc), r(m))\ > cN AdT . For the denominator in the first term of (144 1 
(k, uj)+n m - 0„ = (k, u) + \r{m)\ 2 - \r{n)\ 2 + 2d m - 2d n + & m - Q n , 



\(k,uj)+n m - Q n \ > |2(vr(fc),r(m))| - |(fc,w)| - |7r(fc)| 2 - 4|0| c 



2101 



> c7V 4ar - \u\K - k z K z - 4^U - 2|n|oo > 2^ 



4(/t 



provided we started from K sufficiently large, and we may again set Tk = 0. 

Case 2, ir(k) e (vi)i. We recall that d m (resp. i? n ) are constant on all the to which 
have the same affine space A = [vi;pi]e associated to its £-cut. Moreover we know that 
n has an £-cut with associated affine space B = A + n — m — [w£_qi]t and f2 m , £1„ have 
Toplitz approximations Q defined by fl mA1 fl mA +h- By Corollary 95 (we set h = n — to): 

(W7) \n m -n mA \,\n n -n mA+h \<2\\n\\]:N- AdT . 

We define 

2)*,m,n := (w, k) + \ir(k)\ 2 - 2(ir(k), r(m)) + 2(tf m - tf„) + n mx - O mA+n _ m , 

and notice that 2) depends only on h, the subspace A — [vi\pi]z and on k, since all m which 
have the same (£,t) cut [ujjpi]^ have the same type u — to — r(m), then (ir(k), r(m)) = 
(7t(/c),to) — (Tr(k),u) and, since by hypothesis on /c we know that (i:(k),x) is constant on 
x G A Then h = m — n fixes the type of n. We thus denote T>k, m ,n = ®fc,h,A and by 
construction (recall that n — to + h): 

(148) Dk,m,n = ^>k,h,A + ~ ^m A — + ^m A+h - 

Finally, since 

p(2) 

is quasi-Toplitz, we may set 

where "P = "P^ 2 -* e T p is a piecewise-Toplitz approximation of Pk,o, 
p(2) = N idT(p{-2) _ pl2)) we have the bounds ||-p( 2 )|| r!S , ||p(2)| 



e > can be taken arbitrarily small (see Lemma 8.18). 



< \\P {2) \\} 



so that for 
e where 



We notice that by \1A7\ 

N AdT m h 



'k,h,A — Afc,m,n| = N T \Q m ~ £l mA — + ^m A +h\ < 4||fi||^. 

If the denominators are bounded away from zero then (cf. (145)): 

P^> AT^dr l 

Fk, m ,n = N T (F k ^ n ^ m — T k (h,A)) 



+ ri 2) (h,A) 



D k 



N AdT (D k , K 



'k,m.n 

is bounded. 

Summing over the indexes k, to, n such that to has a cut with parameters (N, 9, /i), we 
obtain 

. IIP (2) IU M\n\\U\v {2) \\s,r 



< 
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This we may rephrase as 
(149) \\F^§<\\P (2) §( 



4||0||£ 



inffc 



h"lffe 5 n.m |^/e,m,n^ 



fc,m,n | 



Then < 7 by hypothesis, (A3*), Smallness condition, formula (118). Our final 

task is to estimate the two inf. We already know that \Df. m ^ n \ > jK~ 2dTl and need to 
estimate 1>k,m,n- In order to conclude we need a technical Lemma: 

Lemma 10.16. For all N > K , k £ Z b with \k\ < K and for all m, n such that: 

1) |r(m)|, |r(n)| > 9N T , 2) r(m) — r(n) = — 7r(fc), 3) m,n have a cut at £ with pa- 
rameters p = (N, /i, 9, t) with associated affine spaces A, B = A + n — m and finally 4) 
7r(fe) £ (vi)i, one has 

\®k,m,n\>lK- T ^ T0 . 

Proof. Since 7r(fc) S (vi)i then the value of (7r(fc),r(m)) is fixed for all m € [vi',Pi]e with 
the same cut. By Proposition |7.12| with N = K we know that relative to the parameter 

K < N, m^lv'f, p'j\ is such that m € [v'^p'j^ for some £, in particular p'^ < cK^a . 

We now have to distinguish two cases, N > (cC^ 1 y^K Tl / T ° or N < (cCT 1 ) -5 ^ K Tl / T °. 



Case 1 N > (cCT 1 ) ^K Tl / T °. We use Formulas fll47| , ([T48|, ([Tl8| and $142^ 

qn\\%N~ idT 



k,m,n 
142 



4|i^||^iV- 4dro 



> 



\{u,k) + n r , 

7 min(i4T~ 

1 [iam(K- 2dTo 1 c 2d p'f 2d ) - (cC-^^K^'K- 4 ^ 1 } 
j[R- Tl/2 - (cC- 1 ) M if MT °- 4 ' ,Tl ] > -fK- 2dTl , 



— 2dr Q c 2d p r_-2d^ 



since #2*ri-n/2 - (cCT 1 ) 2 '* K 2dT «- 2dTl > 1. 

Case 2 If on the other hand we have N < (cC _1 ) ^ K Tl ^ T ° we proceed as follows. 
We consider the space A = [vi\Pi]i £ Hn (and the associated B = A + h, with h = rn — n). 
By hypothesis p e < [iN T < CN Tl / id so A is one of the spaces considered in point iv) of 
Definition 10.8 then by definition 1)k,h,A 
h of point iv) of Definition 



10. 



Dk,m A ,m A +h and vtla +h is one of the points 
Thus by the Definition of the set + we have 



|2>*,fc, 



\D 



k,m,A ,rriA-\-h \ 



2j mm(N- 2dr " , c 2d pj 2d ) > jK~^ 



since by hypothesis p e < (j,N T < CN T < C(cC" 1 ) _5 ^if T i / 4dT ° and K > N a so by ((49]) we 
have cC^K > 1 

□ 



We now conclude the proof of Proposition 10.15 We estimate the denominators in 



(149) by Lemma 10.16 above. Using the definition (49) of tq,t\ we obtain the desired 
bound. 

□ 
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10.17. The new Hamiltonian H + . Recall we have set p = (s,r, K,9, fi, X,0), p+ = 
(s+, r + , K, 6 + , A + , + ). By Propositions 9.2 and 10.15 F defines a M-analytic sym- 
plectic quasi- Toplitz change of variables from D(s + ,r + ) to D(s,r), where r + ,s + are de- 
termined by ( 131 ). 



The change of variables is of the form <f> = I + ^ with the bounds (cfr. ( 100)): 
(150) < 2\\X F § 

and for any function / e T p we have that e ad ^f S Tp + where p+ satisfy (99). 



We now analyze H+ := e ad ( F \H), recall that by definition ad(F)(N) = -P|^, + [P^ 2 ]. 



(151) 



H+ := e ad ( p \N + P)=H + ad(F)H + V ad ^X(H) = 



i>2 



: N + P - Pg 2 K + [P- 2 ] + ad(F)P + 



3- 

ad{Fy 



(H) 



i>2 



We call M + [P- 2 ] := N + and the rest of the Hamiltonian P + , so that 



(152) P + := (P - Pf K ) + {F, P} + 



ad(Fy 



J>2 



J>2 



By formula ( 126 1, 
(153) 



w+ := w+Pq 1 ' , fl+ := Q n +P u 
We need to 

• Prove that H + satisfies conditions (Al) — (A4). 

• Estimate all the new parameters p+. 

10.17.1. Lipschitz estimates M + , L+,\ + . We have 

\uj + \ h " + \n + fp = \u> + P^°\ Hp + ™p |o„ + i#? + , B J Kp < 



3 0,2 
l,n,+,n,- 



+ _ 



CT(n)\r(n)\'+2$ n +n+ = 17 n +P u 



0.2 



■ SUP |fi n |«* + \P^f P + SUp |P °; n 2 + ,n,- f P < M + |Po'°| KP + SUp |P ° 



,2 

71,+, n,- 



by definition of M. 
In the same way 

a;" 1 = (w + Po 1,0 ) -1 = (w o (Id + cj- 1 o Pq' ))" 1 = (Id + w" 1 o Pq 1 ' )" 1 o w -1 , 

so that w + is invertible as a Lipschitz function provided that L|P ' | hp < 1 with the 
bound 

(154) \oJ+ 1 \ Hp < L 



i-lIPo 1 ' ! 1 *' 



We recall that || • || A = || • || + A|| • \\ Hp , hence 

\Po'°\ Hp , \Poi + ,n,-\ Hp < A-'llP^Ilt < M|e1 
since A = r yM^ 1 . We define 

(155) M+ := M(l + 2|e|) , L + := i(l — ML|e]) _1 , 

notice that L + is well defined since by (A3*) LM|e| < 1. By construction 



\uj+\ Up + \^+\ Up < M+ 



>l l \ Kp <L+, 
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finally wc have: 

|w+ - v| < \u - v| + |Po'°| < Me 2 + j\e\ < M+e 2 , 
since 7 < 2Ms 2 . We finally set A + := j(M + )^ 1 we have A + < A since M + > M. 
10.17.2. Toplitz estimates e+,0+. We wish to show P+ is quasi-Toplitz and bound 



-+ 



for h = 0,1,2; 6+ := 7 _1 ||^p + ||a 



T 



We have that (cf. ([152}) P+ = (P - P|£) +A + B where 
i! 



J>2 



[P|£]), P = {F,P} + £ 
i>2 



aci(Fp 



P, 



We argue as in Proposition 5 of [32] or in Proposition 



9.2 



By Formula ([143} < 



K 2ti / r °|e|, the hypothesis (131) implies that wc have the conditions of (981, that is 
12 2 2 "+ 9 e£ -x ||.X>|||' < 1, where S = mm(l - 1 - Therefore we have that 



(156) 

We obtain: 
(157) 



i*a||£ + <25- 1 \\x f §\\x p ^§, 



\X b § + <2S- 1 \\X F \\Z\\X P \\Z 



\x P+ § + < \\X P § 



■iKeS-HXpBWXpWl 



Using (1431 we rewrite (|156| and (|157|) as 
(158) 



(159) 



7 - 1 ||^||I + <r 1 K| £ | 2 , 7- 1 ||^ll? + <<5- 1 K| £ |6, 

e+-e<$ -1 K|e|e. 



Let us now compute the terms of order < 2 in P + , we have: 



P 



00 



P 



^ + (A + {F, P> 2 } + {F, P^ 2 } + £ 

ad(F) J ' 



ad(Fp 



P, we have the bounds: 

T ^ ox— 111 v \\T 11 v 



Again from (1431, denoting E = J2j>2 

\\X {F ^ P < 2} § + <2S- 1 \\X F § ||X P < 2 §<rt 7 |ef , 

< 5- 2 (||x F ||£) 2 ||Xp||J < rt 2 | £ f e. 

The contributions from {F, P >2 } are 

n {P,P >2 } = 0, IIi{F,P >2 } = {F {0 \P^}, 

n 2 {p p >2 } = {p°, pW} + {pw, p( 3 )} , 

so applying the Cauchy estimates we have, setting 3 := const. (<5 _1 K) 2 : 

eV 0) < m 2 (l + e) + 2e^e-^ s ^ K 

e { + ] <3(9e (0) + |ef (1 + 9)) + 2e« e -( s - s +)* 

ef < 3(6(eW + e«) + |ef (1 + 6)) + 2 e^e^ s - s ^ K . 

Note that the terms 2e (7i )e~( s ~ s +* |K come from P># via the smoothing estimates 

We write in matrix form, denoting by e the three dimensional column vector of coor- 
dinates (e (0) , e (1) , e (2) ) and 1 := (1, 1, 1) we have 

e+ < 3(eLe*+|el 2 (l + 6)l) +2e-( s - s +) /f e 
e+ < e + 36|el 



91 



(160) 
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where the matrix L is 

/0 0\ /0 0\ 

(161) L = 1 L 2 = , L 3 = 

\1 10/ \2 0/ 

and the vector inequality means that the inequality is true for all three coordinates. 

Now we need to be able to handle in a recursive way the inequalities obtained so far, 
this we shall do by a formal inequality in the next paragraph which is a variation of 
Lemma 5.8 of [2]. 

10.17.3. A useful inequality. We fix \ such that 

(162) 1< X <23. 
and o, b, c three positive numbers satisfying: 

(163) a, b, c> 1 , 12b 2 < min 

and set 



( 164 ) t: = m A?(^7^> ) >0 - 



e * R (2-x")x'~ 
TeN v 2b(2c) 4 ' 32b 3 c 3 

Lemma 10.18. For j G N consider a sequence (ej,Qj) with 6j := (e ? , ef , e,- ) a vector 



id Qj a number, all with positive components. Set \ej\ := + + ej 2 ' . 
Suppose that for ( L as in ( 161 ) j we have: 



(165) 



3+i < bc^(e,L e ; + |e;-| 2 (i + e J )i) + e- 
e,+i < <->, ■ b^e <; . 



There exists Cq :— Co(c, Xi O, b) > 1 such that for all 9 > satisfying 

(166) 29C < min(l/3,i) 
we have that 

(167) l/3|e* |,e < 6 =4> |e}| < C |e |e-^ , 9, < 9 (1 + C ]T 2^) , V j > 0. 

0<Kj 



Let io be the value of i for which the minimum i in (164) is achieved. One easily sees 
that, since |eo|, Oo < 3© < 1 we can find a value Co (depending only on a, b, c, x) Ior which 
both relations (167) hold for all i < io and |eo| < 1- 

We now work by induction and suppose that both relations hold up to some i > io- 
Then 9; < 9 (1 + C J2i<i<i < 2Q oC and, assuming 29 C < 29C < i, we have 
be2 l+1 e- x% < r 1 < (290C0)" 1 estimate : 

9 l+ i<9o(l + C ^2- i + bc l 29 A 2 e -^)<e (i + Co £ 2-'). 

l<i l<2<i+l 



Notice that the constraint (166) and the inequality (167) imply 9j < 1 for all j. We now 
substitute 9j < 1 for all j < i in the first relation and get 

e i+1 < be' (14 + 2|e,| 2 l_) + e-^e, 
by recursion we obtain a bound for e* i+1 in terms of and e;_2- 
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We now assume by induction that the bounds in (167) are satisfied for all j < i and 
then: 

el+i < bh 21 - 1 ^ 2 ^ + 2|e*_ 1 | 2 Ll) + bcV^L^! + 2bc l |el| 2 l + e~ aT e t < 
e i+ i < b 2 c 2i_1 L 2 ei_i +2b 2 c 2i - 1 |e l _i| 2 Ll + bc^- 02 '" 1 ^-! + 2bc l \ei\ 2 l + e~ a2 ' e> < 



2b 3 c 3l - 3 |e J _ 2 | 2 L 2 l + b 2 c 2l - 1 e- a21 



L 2 el_ 2 + 2b 2 c 2i - 1 |el„i| 2 Ll 



-bc'e-" 2 ' 1 Le l - 1 +2bc l \e l \ 2 l + e- ar e l . 



This in turn implies, since |L|=3,|L 2 |=2, that 



+2b 2 c 2! - 1 e- fl21 



|el_ 2 |+3bc l e 

i-2 



< |e | 2 C 2 (4b 3 c 3i - 3 e- 2 * + GbV^e" 2 * + SbcV 2 * )+ 



|e |C (2b^c 



2 J2i~\ -aT 



-ibCe-^ ~ x +e- a2 '-* 1 ) < 



| £ ' |C [16| e ' |C b 3 c 3l - 3 e- 2 ^^ +6b 2 c 2l - 1 e- a2 ^ 2 -^" 2 ] < |e lC e-^ 
This is achieved provided that, 

[166C b 3 c 3i - 3 e- 2 * l ~ 2 +6b 2 c 2l ~ 1 e~ a2l ~ 2 -^ 2 } < e^ +1 

and this in turn is valid if we assume the constraint (163). 

We will apply this to 



(168) 



C := 4 1 + 4 ^ 2 /^o {, . = CQst K ir^/r a ^ fl . = 



32 



We note that, with this choice of parameters, condition (163) amounts to a largeness 
condition on K . 

10.19. Iteration. Let H° = W° + P° : D x O -t C be as in Theorem^ Define 



(169) 



\\X p (h) 1 1 o 



< e := 



\ x P \\o 



7 7 

Since for any p > we have lim^oo 2 pu e~ x = we have e _1 < max„ e~ x 2^( 1+4ti / t °) < 
oo and we define: 

(170) C* = C*4X,K,d,n,Ti,T ,so) = 2 2n+9 eC Af a,7 lK - ft:2Tl2/ro maxe-*^^ 1 ^ 2 /^ , 



here Co is the constant of Lemma 10.18 with the choice of parameters (168), note that 
it depends only on K, k, d, n, r%, tq, sq, MqQq . Recall that, as we have stated in Remark 



10.3 



Moa > 1 is an e independent constant of the problem. 



We now fix 8 := 0(K, k, d, n, ri, To, sq, ao) in order to ensure the smallness conditions: 
(171) C*6<(12e)-\ JJ(l + C*e e - x " _1 ) < y/2. 



together with the condition (166) of Lemma 10.18 



We now need to estimate all the parameters in the iteration. For the parameters which 
increase we exhibit bounds from above and for the ones which decrease bounds from 
below. Thus for v € N we define 



S v := 2 



o— v— 3 



y+i := (1 - 5 v )r„ , s v+ i := (1 - S v )s„ , D v := D(s Vl r v ) , 



M u := M„_i(l + Creole-*"" 1 ) < v/2M , K ■= fr, 
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• L v := L„_i(l - aie-ole-x"- 1 )-! < 

• K V :=4?K,, 6 v = 6o(l + Ej<v2- j ), ^ = Mo(l - Ej<„ , 
. a^-aoCl-aieolE^S-i), 

• 3„ = <5- 2 const.(2e^ Tl2/ro ) 2 = be". 



We have made our definitions so that min(l — , 1 — ) = <5„. Note that r v+ i \ 

^n"= (i-^) > \ So nr= ( 1 -^) > ' s -i,e v s yze /2<c = 2o , ^\ 

2^o/3 > c = /i /2. 



For compactness of notation we will denote 
(!72) || • ||j := || • lift , p. := {r^a^Oj.K^S^B^n^aj.M^L^c.C), 



where Oj is defined in the course of the proof of Lemma 10.20 and Sq which remains 



constant during the iteration is fixed by setting Sq = S^/uMqLq. 



Lemma 10.20. (Iterative Lemma) Let Co, C*, 6 be fixed as in formulas (166 1, (170) 
and dm]). Let T be as in Formula (pp) and B = 2TK- To+n+d / 2 . If for the Hamiltonian 



H we can choose 7 so that if for O defined in (169) we have: 
(173) 9 < 6 , B-fao 1 < As 2 

are satisfied, then we can construct recursively sets Oj C O and a Hamiltonian Hj 
■Nj + Pj ■ Dj x Oj 



C so that, if we define 



(174) 



Ah) 



\X 



P W \\j 



7 



(0) (1) (2) 



the following properties are satisfied for all j : 



(Sl)j For j > 0, Oj C Oj-i is defined by (fl32|) - (fl36|) with w = ujj-i and fi„ = 1] . 



We have that Hj = Hj_i o $7 where $j 
analytic symplectic maps of the form $j 



Dj x Oj 



Dj_i is a Lipschitz family of real 



*j with ll^ll^. < C^62-J. 



(S2)j The Hamiltonian Hj is compatible with the parameters p . (Definition 10.1). The 



parameters r + 
set O 



rj'+i, s + = Sj+i satisfy the hypothese s (|131 ) 0/ £/ie /fj4M ,s£ep and i/ie 
j+i C Oj satisfies Formula (130), namely, using ( |49[ ) : 

\Oj \ O j+1 \ < T 7 a^e 2( - n -^K- T " +n+d/2 = 



|Oo\O i+ i|<B 7 ao 1 e 2 ( n - 1 \ 



(S3)j There exist Lipschitz extensions ljj, flj of ujj, flj defined on Oq and, for j > 1; 



(175) |wj - cbj^\ + \j\&j - ^_i| lip < jef , ||0^ - fVilloo + A.11% - Qj-xW^ < je) 



.(2) 



lip 



(176) 



I lip 



v| < 
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(S4)j ej,0j satisfy (165) and (167) hence \cj\ < Co|e*o|e 



(S5)j For j > the sequence of composed maps $j := $i o <J? 2 ° • • • ° $j = I + 
satisfies ||* tf+1 _ ^||^ < a|e* |2-, . < 2C*|e |. 

Proof. We proceed by induction the conditions (Si)o are satisfied by the hypotheses of 
Theorem [5] and the constraints on Co, C*, 0. Then, by induction, we prove the statements 
(Si)„_|_i, i = 1, . . . , 5, by assuming the validity of (Si)j for j < v. 



(Sl)„ +1 . We apply the KAM step with H = H v . By (S2)„, we have that H v is 
compatible with the parameters p v . In order to implement the KAM step, and deduce 
(Sl)j,+i we need to verify the constraints of Formulas ( 96 ) , ( 99 1 and (1311 are satisfied for 
r + ,s +1 K + ,0 + ,n + = r v+ i,s v+ x,K v+ i,6 v+ i,iJL v+ \,K = K v ,e = e v . Substituting in ([96]), 



(99 1 we easily see that this amounts to a lower bound on K, depending only on tq, t\ and 



the remaining parameters in p . This we have imposed at the beginning of the algorithm, 
as we explained in Remark |10.4| 



As for (131), we have by induction the inequality on \e u \ and we have to verify 



, - _ „ tlx. 1 1 1 

ko v eC o 0e x K v T o < A , which is contained in the constraints (171). 

Then following the KAM step we construct the set 0+ which coincides by definition 

with O v +\. On 0„+i, we define the generating function F = F v+ \. Then we construct 



the real analytic symplectic map $,,+1 : D u+ i x 0„+i - 
by F. We have: 

H v+l := H+ = H u o $„ +1 =; + P v+1 



D v , Lipschitz in 0„+i, generated 



and P„+i := P+ defined in ( 151 ). 



(S2) y +i. By construction H u+ \ is of the form given by Formula (10.11. We want to 



apply the results of \ 10.17 in order to prove that, V£ € 0„+i, the Hamiltonian H u+ i, is 
compatible with the parameters p v +\- For this it is enough to show that the constraints 
found on p + in that section are in this case valid for the parameters p v +\- First we need 
to verify ( 99 ) which is a largeness condition on K: 
(177) 



e " ^»k)*4 v K Ti <1, k< ^2 6v K A \n{A v K)- A , kC < 9 2 



-)3i/+1^4rfr ( 



hx{A v K)~ 



From Formula ( 171 1 we can bound uniformly M v < V2Mo, L v < ^/2Lq so that we have 
for all v that So = 8y/nM L Q > Ay/nM v L v . By exploiting (155) and |e„| < C |eo|e _x ' y , 
since 2Co, 2Mo£oCo < C*, we verify that M + < M v+ i and L + < L v +i: 

M+ := M V {1 + 2|e„|) < M v (l + 2C |e |e" xl ') < M v+X 

L+ := L V {1 - M v L u C \e a \e- x " y 1 < L v+1 . 



Finally in order to prove that u> v +i is a lipeomorphism we argue as for (154) since 



L v+1 \Pl> {> \ Hp < \/2£oCo|e"*o|e- 



U < 1. 



The estimate on 0„+i follows from Lemma (10.10). We finally have to estimate a„+ 



as in Formula (119). Denote by P. 



3 0,2 

0,*,+,* 



the vector with coordinates P, 



,0,2 

0,n, + ,n, 



By 



Formula ([1531 

(178) |A^«« + (O,fc> + (n+(O,0)l 

> \AUH0,k) + (0(0,0)1 - \AU(Po°(0,k) + (<*,+,*,-> 0)1 
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so it is enough to show that a v — |((P ' (£)) + (Po.'* 2 +.*.-7 0)I' <P — ffl n-i- By construction 
P = P V and |fc| < So thus, since for any P we have by definition \P\ h p < A -1 ||P|| A 

Jl70l 

< 4S o7 A; 1 |e-;| V 472S M Co|eo|e- x < a C*\e \2-'- 1 = a v - a v+1 . 



(S3) y +i. The frequency maps 0J v +i, are defined on O v +\ and, as we have dis- 

cussed in the previous item, have Lipschitz scminorm bounded by M v+ i. Then we may 



apply Formula (153) to deduce the bound (175) (recall that eif 1 = 7 1 ||P 



>(2)| 



for w„ 



and By the Kirszbraun theorem (see e.g. |12j). used component-wise, and 

can be extended to maps- which we denote by ui v +\, defined on the whole 



Oq and preserving the same sup-norm and Lipschitz seminorms of (??),( 176 1 follows. 
Moreover this extension may be performed so that ov+i 



p(i,o) where p(l,0) 

is an 



extension of P^ 1 ' ) which preserves the A-norm (same for 0„ + i); this verifies (175) 



(S4) y +i e u +i = e+ satisfies (160), with C = an d { s ~ s +)K/2 = a v K v . Recalling the 
definition of b, a, c we have that <f„+i, Q v +i satisfy the inequality of ( 165 ). We are in the 
Hypotheses of Lemma 10.18 so that also the bounds ( 167| holds. 



(54) v+1 follows by ( [173] ), (S3),, and Lemma [1018 

(55) y +i. Let us denote by H*'® the normed space of functions in H Syr which depend 
in a Lipschitz way from parameters in O with finite ||.|| A norm. The estimate of the 
norm of the map 4^+1 : H$<° -» H* v '® u follows, using (|143|), ||X F J|„ < K v \e v \ with 



= 2enK v 



2eK(4 u K) 2 ^ / To and hence from ([170} 



I^fJU < 2eK(4 ly if) 



" I^^2ti 2 /t 



C |e*o|e" x <CJe |2- 



-j/-2n-7 



From (1711 and (98) 



122 2n+6 e6- 1 \\X F \L 



< g => ll^+ill < 



1 1 s v ,r u 



Now we can estimate 

1 + = n^ 1 + ii 1 + ^ + n*iii) ^ ri( i + 2 ~ i ) ^ 2 



(179) 



|*„+i-#„|| < C*\e \2- V . 



Notice that ^ v also maps T v to T v and we have similar estimates using the Toplitz 
norms. □ 

Corollary 10.21. For all £ e 0^ := Ci v >oO v the sequence = I + ^ v converges 
uniformly on D(sq/2, Tq/2) to an analytic symplectic map $ = I+ty such that the essential 
part of the perturbation P2^(-,£) = 0. Moreover we have 



|Co\Coo| < costje 



2(d-l) 
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Proof. The fact that the give a Cauchy sequence follows from Formula ( 179 1, therefore 
the sequence <!>„ converges as a sequence of Poisson bracket preserving homomorphisms 
from H s .r to H. Sx . roa to a Poisson bracket preserving homomorphism $00. The fact that 
this is induced by a coordinate change follows from the fact that we can construct the 
local inverse, lim $>' v where $' v = <&' v o o ■ ■ ■ o $j and & v is the flux at time —1. Finally 



P||(-,£) = 0, V£ <= Oco, follows by (174) and (S4)„. □ 

We can finally conclude that 
Theorem 6. If for the Hamiltonian Hq we have on the domains D(s,r) x A e 2 a uniform 



estimate for the perturbation as in (28), i.e. \\Xp\K < C[er + s 5 r 1 ) then provided we 



take r, e sufficiently small the conditions on 7 of the the iterative Lemma can be satisfied. 

Proof. We take r of the order of e 2 so we estimate \\Xp\\ < Ce 3 and then it is clear that 
for e small the constraints on 7 can be both satisfied. □ 

Part 4. The NLS 

In this final part we prove that the NLS is a compatible Hamiltonian ( in suitable coor- 



dinates) according to Definition 10.1 and therefore we can apply to it the KAM algorithm 
and arrive at the conclusions of Theorem [3| Most of our work will be in showing the 
Toplitz property of the normal form of the NLS. 

11. The Toplitz property of the NLS 
We start by noting that for all d > 

(180) A d := ^2 u kl u k2 u k3 u kl ...u ko _ d _ 1 u k2d = \\\ d ^\u a vP 



fciSZ": V(-1) 4 *<=0 o,3e(Z")N : 

\d\=\fi\=d 



a 



is quasi Toplitz. 

In fact we see that the coefficient of u a m u a n in Ad depends only upon <r, a' and am + o-'n. 



We fix c,C as in (49) 



11.1. Semi normal form. We now analyze the Birkhoff normal form change of variables 
defined in ^ with the purpose of proving that it maintains the quasi-Toplitz property. 
Note that the initial variables u,u £ @s=<b = x ^°' P ' ^ e definitions of quasi-Toplitz 
functions of Part. 2 hold also with 5 = and hence n = (i.e. there are no action-angle 
variables x, y). In this case the graph Tg is trivial and the root map is the identity. 

Proposition 11.2. There exist two constants c, C > such that, for all choices of pa- 
rameters < R < £o an d (K,9,fi), the quartic term A2 in the NLS Hamiltonian Q and 
the generating function Fp ir k defined in (Jsj) are quasi-Toplitz with \Xp Birk \J R / 2 k e «) — 



c\A2\T R K g ^\ < cR 2 . Then for all (K',6',fjf) which respect (99) with (K,9,n) we have 
that o H = H Blrk + + P (6 *> with \X P (,)\f R/4Kg ^ < CR l ~ 2 , i = 4, 6 and 

\ X HBirk-^\jR/4,,K t d,fj,) - CR2 - 

Proof. Since A2 is Toplitz the fact that it is quasi-Toplitz is trivial, the bound \M\J R K e ^ _ 

cR 2 follows by the degree estimates ( |27[ ). Given m, n such that m^[vi;pi\ and m, n have 
the cut I with parameters 9, fi, t, we collect in Fsirk the terms which are N, 9, /i-bilinear 
with cut r on to, n and (a, o~') = (±, ±). We denote each summand by P„ „ ■ By symmetry 
we may consider just +, + and +, — . 



A KAM ALGORITHM FOR THE NLS 



57 



Set c Qj( g := ( Q ) (o) for simplicity of notation. In the case +, + we write a = ao + 
e m + e„,/3 = /3q, in the case +, — we write a — a Q + e m , P = /3q + e n . By hypothesis 
\m\, \n\ > cN Tl hence they are in S c and | ck 2 ] + i ^2 1 > 2. Then the condition | ck 2 | -h | /3 2 j < 2 
of implies that the support of ao, fio is in S. We have 

p+.± _ _■ u u 

m,n ' 1 2^ C ^V._( a 0_ / 30)|j|2 + | TO |2 ± | n |2- 

(the symbol ~J2* summarizes the conditions of ([8])). 

In the case +, + we claim that the denominator is big so that we can choose J r+,+ = 0. 

Indeed we have |m| 2 + |n| 2 > 2cN Tl while | EjeS^j ~ ^j)!^ 2 ! < 2 ^ 2 where k := 
su Pjes Ijl- 

Notice that in any case n — m = Tr(a°, /3 ) := X^<Es( a j — Pj )j 
In the case +, — we write 

|m| 2 - |n| 2 = (m - n,m + n) = (m - n. 2m + n- m) = -2(7r(a , /3°), m) - |7r(a°, /3°)| 2 . 

We have to distinguish two terms in the sum, that is the ones in which 7r(a°,/3°) := 
~ Pj )j ^ anc i the other case. 

* w Q u^ 



'E j6 s(«S - Z 3 ?)!:! 2 - 2(7rK,/30),m) - |7r(a°,/J°)f 



In the first case, since m has a cut at £, we have \(v, m)\ > 9N 4dT for all v ^ hence 
the denominator is big and the norm of the first summand is bounded by cN~ idT \A2\J . 

In the second case the denominator depends only upon a°,/3° and on the affine space 
[vi',Pi]e associated to the cut. Also the constraints in the sum ([sj) depend only on a°,/3° 
and on the space [vi]pi]i hence the second summand is in Tn,9,^- The bounds follow by re- 
calling that the denominators are non-zero integers. Then the bounds on the transformed 
Hamiltonian follow by degree considerations. □ 

We fix 

(181) = C(l-~), 0' = C(l-~), // = c(l + ~), M ' = c(l + ~) 



so that ([99} holds for all K = K' > N . 

11.3. Action angle variables. The results we need are mostly contained in [2], although 
there are some small notational differences and the results in that paper are stated for Z 
instead of Z d , but the proofs follow verbatim in our case. 

We introduce action-angle variables on the tangential sites S := {ji, . . . , j„} via the 
family of analytic and symplectic maps 

(182) $(x,y,z,z;£) :={u,u) 

defined by 

(183) 

u ii '■= V^i+Vi elx ' > u ii '■= VZi+Vi e ~ lx ' 1 1 = !> • • Uj ;=Zj, Uj := Zj , j e Z d \ S . 
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Let for p > 

(184) A p :={eeM" : | < < p, Z = l,...,n}. 

Lemma 11.4. (Domains) Le£ s,r, e, i? > satisfy 

(185) 4r < e , i? = C*e with C* := 4^K p e (s+aK) . 
Then, for all £ € A e 2 U A 2e 2 , the map 

(186) $( • ; : £>(*, 2r) -> 2>(fl/2) := B fl/2 x B fl/2 c x 

is we/Z defined and analytic (recall that D(s,2r) is defined in (12) and k := supj gS |j|). 
For the proof see [2] Lemma 7.5. 

Given a function F : T>(R/2) — > C, the previous Lemma shows that the composite 
map F o $ : D(s,2r) — > C. The main result of this section is Proposition 11.5 if F 
is quasi-Toplitz in the variables (u, u) then the composite F o $ is quasi-Toplitz in the 
variables (x,y,z,z) (see Definition |8.13 1. 

We write 

(187) F^i^m^, ra aJ := (^V^V^)^^ , 

where (5 C = Z d \ 5) 

u = (it^ 1 -*, u^) , it*- 1 -* := {iij}j£s , := {uj}j e s c > similarly for u, 

and 
(188) 

(a,/?) = (a«W 2 \/3«+/^), (a (1) ,/? (1) ) := &} ie9 , (« (2) ,/3 (2) ) := {a,-, ■ 
We dchnc 

(189) H%:={FeH R : F= £ i^/V} . 

Q (2)_|_£(2)> D 

Proposition 11.5. (Quasi Toplitz) Let p = (r,s,K,6,fi,X,A e 2), with K,6,p,,p' be 
admissible parameters and 



(190) 



Q T R/2,K,e,»> n n R/2' then f~Fa<f>eQT and 



(191) 



l/IIJ < (&r/R) 



D-2 ^ II r?\\T 

2 II llfl/2,if,0,/i' 



Corollary 11. 6. F or all e > 0, e/4 > r > e 3 and s > satisfying ( |185 |, i/ie perturbation 
P of Definition 2.1 is in for the parameters p — (s,r,8 = C(l— |),/i = c(l + |), A, A e 2). 
Moreover P satisfies the bounds 



(192) 



where C does not depend on s,r. 

Proof. We choose pi = c(l + |), the perturbation P has contributions from three terms: 
1) The term P^ o $, 2) The term p( 6 ) o $ and finally 3) the terms of degree > 2 in 
(Hsirk — K) o By proposition 11.2 with the choice of parameters (181) all the terms 
above are quasi-Toplitz with the parameter p! and the Bounds of Proposition 11.2 hold. 



A KAM ALGORITHM FOR THE NLS 



on 



In item 1) we note that, by definition, P^ € % R so, by Propositions (11.5) and 
(11.2), we have xll^P< 4 'IIJ — { r /Ft)R 2 < Cer. In item 2) we recall that by momentum 
conservation the first term of P^ of degree D = ( P^ G W^Wg) is actually of degree 
at least 8. Then we divide P^ = R+Q where Q g T-L R and R is of degree at least 8 in u, u. 
By Propositions pXE] and |iT2| ), we have < (r/R)~ 2 R 6 < Ce 8 r- 2 < Ce 5 ^ 1 . 

In the same way ||X Q ||? < (r/R)- 1 R 4 < Ce 5 ^ 1 . 

Finally in 3) we collect the terms of degree 3 and 4 in formula ^ , we get the estimates 
T\Xu>* { H Birh -w.)\ T r <er. □ 

The rest of this section is devoted to the proof of Proposition |11.5| Introducing the 
action-angle variables ( |183[ ) in (187), and using the Taylor expansion 

f g\ = g(g-l)...(g-h + l) 



(193) (l + t) 9 = J2\ 9 h ) th 



h>0 



:= 1 . 



h\ 



h>l. 



we get 
(194) 



with Taylor-Fourier coefficients 
(195) 



/fc,»,a< 2 >,/3( 2 > : — J! ^a^II^ 
Q<i)-/3(i)=fc 1=1 



n „ (D + „{i) 



^ (1> +ft (1) 
2 

'7 



Lemma 11.7. (M-regularity) If F e U R / 2 then f := F o $ e W s . 2r and 
(196) 

H*/lk2^/4,2 £ 2 ]" < (Sr/i?)^ 2 !!^!!^ , < e- 2 (8r/R) D - 2 \\X F \\ R/2 . 

Moreover if F preserves momentum then so does Fo$. 

Proof. See [3] Lemma 7.7. □ 

Definition 11.8. For a monomial m a ,p := (uW) a<1) (S^)^ 1 ' (u( 2 )) a(2) (u^)' 3 ' 2 ' (as in 
( [T87| >) we set 

(197) p(m Q ^) := ^(j,)^' + , (?) := max{l, . 



i -n 



For any i* 1 as in ( 187), K £ N, we define the projection 

(198) n p > K F:= F - 

P(w.<»,p)>if 
Lemma 11.9. Let F € Hr/2- Then 

(199) ||^"(n p >,K-.F)o4||s,r,A c 2 < 2 _ 3^ + 1 ||X Fo $|| S! 2r,A 2e2 ■ 

Proof. See [3] Lemma 7.8. 



□ 



Let K, 6, /x, //, r be as in Proposition 11.5 For N > K and F £ Hr/2 we se t 
(200) 



/* 



.((F-njv.e^v^o^ 



Note that Hn 9,u' t is the projection on the bilinear functions in the variables u, u, while 
Hn,o,h,t in the variables x,y,z,z. 
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The next Lemma corresponds to Lemma 7.9 of [3]. 
Lemma 11.10. We have 

(201) \\Xr\\;r,A o < 2-^ +1 \\X 



Fo$||s,2r,A„ 2 



Proof. We first claim that if F = m a p is a monomial as in (1871 with p(m Q p) < N then 
/* = 0. 

Case 1: m a ,p is (N, 9,/j,', r)-bilinear, see Definition 8.2 Then ,r^a,p = ^a.p 

and /* = 0, see p00|. 

Case 2: m a ,p is not (N, 9, //, r)-bilinear. Then Tl^^^i iT m a ,p — and /* = HN,0,p,T{^a,fi° 
$), see (200). We claim that m a ^o<l> is not (N, 9, fi, T)-bilinear, and so /* = L T jv i 6» iMiT (m Q ^o 
$) = 0. Indeed, 

,„„„, _ ,^ , i((a (1) -S (1) ) x) a (2) -3 <2) 

(202) m ai/ 3 o $ = (£ + y) 2 e'U" p J^ z q z p 
is (AT, 0, jit, r)-bilinear if and only if (see Definitions 8.2 and 8.1 1 

Q (2)_ /3 (2) _ & (2)_^2) a a , 

(203) ^ +^ (2) )< M 7V 3 , |m|,|n|>ft/\r\ |a« - /J«| < JV , 

j£Z d \S 

and n, m have a cut at £ with parameters 9, (J.,t. 

We deduce the contradiction that m a ,p = (u«) q(1) (uW)' 3 ' 1 ' (u( 2 ))" <2) (u( 2 ))^ <2> «' is 
(TV, 0, //, r)-bilinear because (recall that we suppose p(m a ^p) < N) 

ElJ«l(«g ) +^ , )+ E + /3f)^^P(m^) + ^ 3 <iV + / ,iV 3t f 1 ^. 

For the general case, we divide F = n p< ArF + H p >nF. By the above claim 



Finally, (201) follows by applying Lemma 11.9 to (jI p >N(Id — Hjy^ 4l i)F) o $ and using 
the fact that projections may only reduce the norm. □ 

Lemma 11.11. Let F <G Tr/2,n,b,ii>,t with ^p>nF = 0. Then F o $(•;£) € T s ,2r,N,e,n' ,r, 

V£ e A £ 2 u A 2e 2 . 

Proof. Recalling Definition |8.7| we have 



F=J2 E ^"'(AomWnX,^ with F°' a ' {A,h) £ £ R/2 (N, n' ,h) . 

AeH N \m\,\n\>6N^i ,a,a'=± 

denoting A = the apex * means the sum restricted to those n, m which have a cut 

at I with parameters {9, /i, r) and m has associated space A. 



Composing with the map $ in ( |183[ ), since m,n <£ S, we get 

<r,o'=± ,\m\,\n\>6N T i 

Each coefficient F a - cr (A, am + a'n) o $ depends on n,m,a,a' only through A, am + 
a'n, a, a' . Hence, in order to conclude that Fo$e Ts,2r,N,8,ii' ,t it remains only to prove 
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that F a,a (A, am + a'n) o $ e £ Si2r (7V, [i 1 , am + a'n), see Definition 
Tn Q ,/3 of F a ' a (A, am + a'n) g £fly 2 (iV, /A crm + a'n) satisfies 



8.1 



Each monomial 



+ + I] + ftOlil < M^ 3 and p(m a ^) < JV 



1=1 



by the hypothesis n p >jvP = 0. Hence m a ^ o $ (see (202)) is (JV, //)-low momentum, in 
particular |«W - < p(m Q!(3 ) < N . □ 



Proof, of Proposition 11.5 Since P g Q^, 2 K g , (see Definition 8.13 1, for all N > K, 
there is a Toplitz approximation F € Tr/2,n,9,h',t °f F, namely 



(204) 



n 



, tT F = P + 7V- 4dT P with ||X p |U /2 , ||X # || fl/2 < 2||P||J A w 



In order to prove that / := Po$ e r k e u we define its candidate Toplitz approximation 



(205) 



/:=ntf,fl, MlT ((n p<w F)o$), 



see (198). Lemma |ll-ll| applied to n p<w P G Pr/2.jv,0.^'.t implies that (II p<A rP) o 
$ G T s ,2r,N,e lf j,',T and the n, ap plying the projection LTjv.g u , r we get / € T s ,2r,N,8,fj,,T C 
Ts,t,n,o,h,t- Moreover, by ( |205 ) and applying Lemma 11.7 to II p< ArP (note that LT p< jvP 
is either zero or it is in 'H^Jwith D > 2 because it is bilinear), we get 



/lls,r,A e 2 — ll^(n p<J vF)o*)lls,r,A e 2 



(206) 

Moreover the Toplitz defect / is 



(8r/R) 
(8r/R) 



D-2\ 



X 



n„ <N FllA./2 



D ~ 2 \\ Pll r 

\\ r \\R/2,K,e,fi', 



N idT (Tl Ni o^ T f - /) ^ N idT lI N ,e, M , T ((F - U P<N F) o $) 

iv*n Wj9 ^ T ((F - f) o $) + 7v 4 ^n JV , e , M!T ((p - n p<7V p) o $) 
n^ e , M , r (P o $) + N^u^e,^ ((p - njv.a.^F) o $) + 7v 4 ^n JV , e , M , T ((n p >^P) o $ 



n 



r (P o $) + jV 4dT /* + ^^^^^((n^jyP) o *) 



Lemmata 



11.9 



and 



11.10 



imply that, since N 4dT 2~^ +1 < 1 ViV > K by ( fl90] ), 

^/l|s,r,A e 2 < ll^"#o*lls,r,A e 2 + N Ut 2 2 » +1 (|| -X> $ || s,2r,A 2e2 + II^Fo* II s,2r,A 2s 2 , 



(207) 

(208) 



{&r/R) u - 2 (\\X p \\ R/ 2 + ||X F || fl/2 + \\X p \\ R/2 ) 



(8r/R) 



D - 2 \\ Pll T 



(to get (207) we also note that we can choose P, P so that they belong to the same H D 
as P. The bound Jl9lj) follows by (pi, ((2061), ((208). 



-R/2 
□ 
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12. Final variables 

For all k £ S c set r(fc) := r(j4) to be the root of the component A of Tg to which k 
belongs (this is chosen once in one of the graphs in the same translation class) . We have 
thus associated to each k an element L(k) £ Z n , see Theorem [I] and formula ( [35] ) : 

(209) z k = e- iL «-*4, y = y'+Y J L(k)K\\ * = ■ 

keS° 

to define a symplectic change of variables D(s,r/2) — > D(s,r) in which the normal form 
has constant coefficients. 

Thus the momentum of z' k is r(fc). We need to see what happens to (N, 8, fx, r)-bilinear 
monomials first. 

Take a monomial 

m = e i{k ' x) y l z a zP 

We have 

*om = e i <*»(i/ + J2 L(k)Wk\ 2 ) l z"*t'P , k' = k-J2L(j)(^-Po)- 

Hence we obtain a sum of monomials 

(210) e^'^O/OVTO^I 23 , h<l, |fc| + |ff| = |I| 
all with momentum: 

(211) n r (k',a,l3) = Tr(fc') + - &)r(j) = n(k', a, p) + ^(a,- - &)(r(j) - j), 

3 3 

As in the previous section we define a cut off parameter 

P(m a , ,k) ■= \k\+2dn{\a\ + \/3\), 

and set 

(212) Tl v > K F := ^2 Fa,^a,l3 i ^lp<K = I - U p > K ■ 

p(m a ,?)>K 
Clearly for all F £ U s . r - Then 

(213) \\X(n p > K F)o<S>\\s,r < 2 2dK+1 ||A fo<I ,|| ;ji 2r • 

Lemma 12.1. Tafce a function F £ tH s ^, assume that 

(// - n)N z > N ,(9- e')N Tl > 2dn 

and 

U P>N F = U^^,F = 0. 

Then we have 

f* ■= u N ,g !lltT (F - n N . g ,^,. T F) o * = o 

Proof. We may assume that F = m a ^ t k is a monomial. If F is bilinear the statement 
is clear. Otherwise /* is a sum a monomials described by formula 210 If one of these 
monomials is bilinear its high variables either come from one of the new exponents g or 
already appear in a,f3. In the first case this is possible only if m is (N, //)-low contrary 
to our hypothesis. In fact suppose that g = g + e m , where to = n is the high variable, 
and that 

3 
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Then since gj > and \j — r(j')| < 2dn we have 

E lil(aj + ft) < M^ 3 + 2d«(|a| + |ft) < //iV 3 . 
j 

Finally since by hypothesis \k\ < N we deduce that m is low. In the other case the two 
high variables already appear in ttia^fc, we claim that this implies tna,p,k (N , 9' , fi' , r)- 
bilinear contrary to the hypothesis. In fact let m, n be the high variables appearing 
in ttia^fe, write a = a + e m ,f3 = j3 + e„, the monomial appearing is tU g+9 p+g,k' z mZn 
and by hypothesis we have that \k'\ < N and ^ ■ |r(j)|(aj + ft + #j) < /iA^ 3 . Then 

We are assuming that p(m ai p t k) '■— \ty + 2d(\a\ + |ft) < N hence < N and we have 
that Tn a ,p,k is (N, 0', /j,', r)-bilineax. 

□ 

We next analyze a function F with II ,F = F and again we may assume that it is a 
monomial F = m aJ ^ k , in this case /* := U Nj g ilJltT (F - U N ^^^ T F) o f = n Wi9 , WiT F o f 
is a sum of monomials m Q +g.^+g.fc< |z m | 2 arising from the terms 
9 + e m . 

Lemma 12.2. Given a function F with Ilj^ ^,F — F then f* = Hn.6,^,tF ° ^ is piecewise 
Toplitz and diagonal. 

Proof. By the previous remarks we may compute explicitly /* as: 
n^(V y Fo$). ]T L(m)\z m \ 2 , 

|m|>9N T l , 

i7ie(w,e,/i,T)-cut 

we have that /* E 7~(n,(Lu,,t) since L(m) is fixed on all the (N, 9, fi, r)-good points of any 
subspace (by Theorem El). □ 



210 



for suitable \z\ 2g , g — 



Lemma 12.3. Given a function F £ T^n ,e' .t) } then TI.n,8,h,tF ° ^ € T{N,6,ii,T) 
Proof. Recalling Definition |8.7| we have 

F= E E F <T ' CT '(A,CTm + cr'n)z I £T n z^' with F°' a> {A,h) € C r<s (N, fj,' ,h) . 

AeH N \m\,\n\>6N^i ,<t,(t'=± 

denoting A = [vi,Pi\£ the apex * means the sum restricted to those n, m which have a cut 
at £ with parameters {9, /i, r) and m has associated space A. 

Composing with the map ^ in (209), since m,n ^ S, we get Il( N t g^ r \F o = 

E E n^(^(A,am+y n ) o * e - i ^w+'' i W''))« l rw B ' . 

AeV.N \m\,\n\>6N T i ,a,a' = ± 

Each coefficient F cr '°' (A, am + a'n) o $ depends on n,m,a,a' only through A, am + 
a n, a, a' , same for aL{m) + a' L(n). □ 

Proposition 12.4. (Quasi— Toplitz) Let K,9,9' , fj,, /j,' be admissible parameters and 

(214) fa' - fj,)K 3 > K , , (9 - 9')K T1 > 2d K T1 2~^ +1 <1. 
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IfF e Ql sar {K,e' ,^'), then f := F o $ e Q T sr {K,9,pL) and 

(215) ll/lls,r,AT,6i,^,A e 2 < ll-^llis^r.iir.e,/*' ■ 

Proof. Consider N > K and suppose that F has no iV, //-low terms. In this case the 
proof is identical to that of Proposition 1 1 1 .5| provided we use the corresponding Lemmata 
of this section. We conclude the proof by noting that H(n,0,h,t) (J^n ,u'F ° <= Tn.9,^,t 
by Lemma [l2.2| Hence in this case the Toplitz defect is zero. □ 

12.5. The final step. In the final step we diagonalize block by block the matrices. Let 
us denote by <E>' this change of variables. Here we need only to remark that, since we 
are making linear transformations among variables Zk which have the same root, any 
monomial in these variables is replaced by a homogeneous sum of monomials in the new 
variables, all of which have the same momentum (with the same roots), so the space 
spanned by (iV, 9, [i, r)-bilinear monomials is preserved. 
It remains to understand what happens to the space 7~ p . 

Consider g(A) := Y? n ,m,<,,<r' 9k,lA <JTn+a ' n > i v HPi]f> I)e i{k ' x) z a z 13 z^' , what we see 
is this, changing the basis in the block of m and n we have a certain linear combination 
which is preserved if we translate m or n so if the block of m has the two bases a, and bi 
and say m = a\ and similarly for n we have that am + a'n is indeed just one of finitely 
many possibilities given <2j , bj . 

Proposition 12.6. The Hamiltonian of the NLS in the final coordinates is quasi-Toplitz 
for the parameters po = (tq = r/4, sq = s/4,6q = C/2,/io = 2c, K > Nq,X = 2e 2 ,0 = 
A s 2). We have the bounds: 

(216) ||**o*'op||J < C^r + e'V- 1 ), 

Proof. By Corollary |11.6| we have that P is quasi-Toplitz with parameters s,r,K,9 = 
C|,/x = c|,2e 2 ,A £ 2). Since 

( 3 -c- 5 -c) N 3>N Q , (|-|)C^>2d, N?2~^<1 

we apply Proposition |12.4| and obtain the desired bounds for $oP with the parameters 
(s/2, r/2, 9 = |C, fi = |c, K > N , 2e 2 , A e z). Then we apply the last change of variables 
and reduce the analyticity radius by |. We obtain the desired result. □ 

12.7. Final conclusions: solutions of the NLS. 

Proposition 12.8. The Hamiltonian of the NLS in the final coordinates is a compatible 
Hamiltonian in the sense of Definition \10.1 and satisfies the hypotheses of Theorem [6] 
provided we choose r = e 1 and e small. 



Proof. The fact that it satisfies the hypotheses of Theorem [6] follows from (216). We need 
to verify all the conditions (Al) — (A4). 

(Al) Non-degeneracy: The map £ — > w(£) is £ i— ► v — 2£ so it is a lipeomorphism from 
O to its image with < 1. We have |w(£) — v| < e 2 since by assumption |£| < e 2 . 

(A2) Asymptotics of normal frequency: For all n € S c we have a decomposition: 
(217) n n (0 = <j(n)(\r(n)\ 2 + 2MO)- 

In our case we start from f2 n (£) = 0. We know that the i? n (£) are in a finite list of analytic 



functions which are homogeneous of degree one in £. As for (116), by homogeneity, we 
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can fix M > 1 so that 2 + 2|tf < M . 
We choose 7 = ae 2 < 2Me 2 . 



2|^|oo < Me 2 . This fixes the parameters M, L. 



(A3) Regularity and Quasi-Toplitz property: The function (ilz, z) — 0. As for P, z) :- 



Y]- &j\zj\ are M-regular, preserve momentum as in (44), are Lipschitz in the parameters 



f. Then P is quasi-Toplitz with parameters (sq, ro, K, 9q, [iq, j/M, A e z) by Proposition 
12.6 with the bound s (216). Moreover we know that the functions are constant of the 
strata Ar, a of { 7.23 hence (fiz, z) := ^ ■ $ 1 | 2 is M-regular, preserve momentum and is 



quasi-Toplitz and for all N > K tq < r < ri/4d we have 11(^,0,^,-7-) J2j^j\ z j\ 2 e T[ 



(A3*) Smallness condition: We first fix K large enough (independently form r ). We 
compute 6, |e| < Ce 3 ^ 1 = Cea^ 1 by (|216[). The condition 



(218) 6<1, LM|e|<l, 7 - 1 ||Jf <Ar) , > ||J<niin(l,A^/4) ) Ke|e|JiH? « 1 



follows from ( 34 ) , (|192[) , with r = £ , provided that — is small enough (note that the third 



condition is trivial since O = 0). 

(AA) Non-degeneracy (Melnikov conditions): For all (k,l) 7^ compatible with momen- 
tum conservation the function (oj, k) + (CI, I) is of the form (v, k) + (V, I) — 2 k^ + 28 
where 9 can be 0, ±i?j±i?fc. From the main result of [T7] we know that all the functions 
J2i + ^ are analytic, homogeneous of degree 1 and different from 0. We deduce thus 
that there exists a non-empty open region Co where a :— inA^ n€ o a | A^,, ; ((w, k)+(Cl, l))\ > 
for |fc| < 16*/n. 

□ 

For compatible Hamitonians we have proved in the previous section a general Theorem 
[6] which insures the existence of KAM-tori, now we can apply this Theorem to the NLS 
and have our final result. 

Theorem 7. Given any set of n generic frequencies S and the corresponding solutions of 
the resonant Hamiltonian depending on n parameters £j we have a Cantor set of positive 
measure in the parameter space for which these solutions persist giving a family of solutions 
of the NLS which are also linearly stable. 

In an open set of this Cantor set of positive measure the solutions are also elliptic. 
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